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Abstract 

Suppose X = (X x ,x in Z d ) is a family of i.i.d. variables in some mea- 
surable space, Bq is a bounded set in R d , and for t > 1, Hf is a measure 
on HBq determined by the restriction of X to lattice sites in or adjacent to 
tBQ. We prove convergence to a white noise process for the random measure 
on B given by t~ d l 2 (H t (tA) — EH t (tA)) for subsets A of Bq, as t becomes 
large, subject to H satisfying a "stabilization" condition (whereby the effect of 
changing X at a single site x is local) but with no assumptions on the rate of 
decay of correlations. We also give a multivariate central limit theorem for the 
joint distributions of two or more such measures Ht, and adapt the result to 
measures based on Poisson and binomial point processes. Applications given 
include a white noise limit for the measure which counts clusters of critical 
percolation, a functional central limit theorem for the empirical process of the 
edge lengths of the minimal spanning tree on random points, and central limit 
theorems for the on-line nearest neighbour graph. 
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1 Introduction 



Several approaches have been developed for proving central limit theorems for ran- 
dom variables which arise as the sum of contributions from points of a Poisson or 
binomial point process in R , when each contribution is locally determined in some 
sense. These include Stein's method, the method of moments, and a martingale 
method. 

By keeping track of the location of each contribution in M. d , one can often in 
a natural way create a random measure, i.e. a random field indexed by subsets 
of M d or by test functions on M. d . It is of interest to look for multivariate central 
limit theorems for such random fields, typically with weak convergence of finite- 
dimensional distributions to those of white noise. Multivariate central limit theorems 
of this type were recently derived using the method of moments by Baryshnikov and 
Yukich [2] and can also be derived using Stein's method (2D]- Both of these methods 
seem to require, in addition to a 'stabilization' condition which formalizes the locally 
determined contributions, a form of exponential decay of spatial correlations. 

It is of interest to extend these results to cases which satisfy stabilization but 
are not believed to satisfy exponential decay. These include, for example, measures 
associated with the minimal spanning tree (MST) on a Poisson point process or 
with critical percolation. The martingale method is especially powerful in giving 
central limit theorems for these examples (Kesten and Lee Zhang |27j). For an 
exposition of this method in a general setting, see Penrose [T3|, Penrose and Yukich 
|18j . However, these works do not address the convergence to white noise of random 
measures. 

In the present paper we extend the martingale method to give such a convergence 
to white noise of stabilizing random fields indexed by subsets of M. d , and illustrate the 
method both for percolation and for the minimal spanning tree. A further multivari- 
ate direction in which we extend the existing literature is by considering convergence 
to a multivariate normal for two or more random fields based on the same underlying 
spatial process; in particular, we shall show that the finite-dimensional distributions 
of the empirical process of the lengths of the MST on random points, suitably scaled 
and centred, converge to those of a certain Gaussian process. 

A further aim of this paper is to treat discrete examples (such as percolation) 
and continuous ones (such as the MST) in a unified manner. In this spirit we shall 
derive our basic general result for Poissonian continuum systems (Theorem 12.2)) by 
direct application of the basic result for lattice systems ( Theorem 12. lj) . although 
some extra work is needed to give the limiting covariances for Theorem 12.21 in a 
more explicit form than was done in previous continuum central limit theorems 
proved by the martingale method. For this reason, in the continuum we consider 
only uniform densities of points over a fixed <i-dimensional set (denoted B in the 
sequel), unlike Baryshnikov and Yukich who consider non-uniform densities. It is 
likely that with extra work, the martingale-based proof of Theorem 12.21 could be 
extended to give multivariate central limit theorems for point processes with non- 
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uniform densities. For martingale-based proofs of univariate central limit theorems 
on non- uniform points, see Lee [TT] and Penrose ([T5]. Section 13.7). 

In the continuum, stabilizing random fields are often defined in terms of graphs 
G which are themselves stabilizing, i.e., locally determined in a certain sense. Stabi- 
lizing graphs include the MST, fc-nearest neighbour, and sphere of influence graphs. 
Given a stabilizing graph G, the theory presented here applies to random fields (in- 
dexed by subsets A of M. d ) which count, for example, the number of leaves of G in A, 
the number of components of G that include vertices in A, or the sum of weighted 
edge lengths 0(|e|) with the sum over edges e of G having endpoints in A. 

One stabilizing graph which has not been considered in previous discussions of 
stabilizing graphs is the on-line nearest neighbour graph, in which random points in 
B are randomly ordered, and each point (except the first) is connected to its nearest 
neighbour amongst its predecessors in the ordering. This graph is of recent interest 
in connection with the modelling of scale-free networks [31 E] Unlike methods based 
on exponential decay, our methods provide central limit theorems for this graph too; 
see Section EIH 

The rest of the paper is laid out as follows. The next section contains statements 
of the main general results. Section El contains applications of these to percolation, 
MST, and nearest-neighbour type graphs. Sections |U and El contain proofs of the 
general results. 

2 General multivariate central limit theorems 
2.1 Notation used throughout 

Let d > 1 be an integer and let denote the origin of ~R d . For x G K d , write \x\ for 
the Euclidean norm of x. For A C ]R d , t g 1, and y G W 1 , let tA denote the scaled 
set {tx : x G A}, and let r y (A) denote the translated set {y + x : x G A}. Let d(A) 
denote the boundary of A, that is, the intersection of the closure of A with that of 
its complement. If A is (Lebesgue) measurable, write \A\ for its Lebesgue measure, 
and if A is finite, write card(A) for the number of elements of A. Write diam(A) for 

sup{|x — y\ : x G A,y G A}. Given a sequence of sets (A n ) n >x, write liminf(A n ) for 

i ioo fn°° A \ 

For z G Z d , and e > 0, let Q £ z denote the cube t £Z ([-s, 0) d ). For x G R d , and 
r > 0, let B r (x) be the closed Euclidean ball of radius r centred at x. Let p be a 
finite constant, satisfying p > \fd but otherwise arbitrary. For A C ]R d , let A denote 
the discretization of A given by 

A:= {z G Z d : B p (z) n A ^ 0}. (2.1) 

The condition p > \fd ensures that Q £ z C B p (ez) for e < 1. 

For a > 0, let jV(0, a 2 ) be the normal probability distribution on R with density 
f(x) = (27Tcr 2 )~ 1/ ' 2 exp(— x 2 /{2a 2 )). Also, let A/"(0, 0) represent the degenerate prob- 
ability distribution on R consisting of a unit point mass at zero, which we view as a 
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special case of the normal. Given a nonnegative definite kx k matrix E, let Af(0, E) 
denote the centred multivariate normal distribution with covariance matrix E, i.e. 
the distribution of a random fc-vector X satisfying a'X ~ A/"(0, a'Ea) for all deter- 
ministic k- vectors a (this definition includes the case when E is singular). Denote 

P L p 

convergence in probability by — ►, convergence in pth moment by — convergence 
in distribution by — ►, and denote equality in distribution by =. 

We say a subset of M. d is Riemann measurable if it has Riemann integrable indi- 
cator function; in other words (see Rudin [22] ), we say a subset of M. d is Riemann 
measurable if it is bounded and has Lebesgue-null boundary. Let lZ(R d ) denote 
the collection of Riemann measurable subsets of M. d . In the sequel, we shall assume 
B is a fixed set in 7Z(M. d ) (so in particular B is bounded); we shall also assume 
that | i?o | > 0, which is equivalent to assuming that B has non-empty interior. For 
example, B could be the d-dimensional unit cube. Let TZ(B ) denote the collection 
of Riemann measurable subsets of B . 

For y, z G Z d , write y -< z if y precedes z in the lexicographic ordering on Z d , 
and y =4 z if either y -< z or y = z. 

2.2 A central limit theorem for lattice systems 

Let (E,£,Pq) be an arbitrary probability space. On a suitable probability space 
(fi, P), let X = (X z , z G Z d ) be be a family of independent identically distributed 
random elements of E, each X z having distribution Pq, indexed by the integer lattice, 
and let X* be a further E- valued variable with distribution P , independent of X 
(i.e., an independent copy of X ). For existence of such an (ft, J 7 , P) and X, see 
for example section 8.7 of Williams [21]. For y G Z d , let r y X denote the translated 
family of variables (X z+y , z G Z d ). 

Suppose B G 7Z(M. d ) with \B \ > 0. By a random set function on B we mean 
a collection H = (H t (A) : t > 1, A G K(B )), where for each tj> 1 and A G Tl(B ), 
H t (A) is a random variable that is a function of (X z ,z G tB ), so that, strictly 
speaking, H t (A) is itself a measurable function from E tB ° to 1R. If we wish to 
emphasize the dependence on X of the value of H t (A) we write H t (X, A) for H t (A). 
In many examples H t (-) is a (random) measure or outer measure (see, e.g., Durrett 
j3]) on Borel subsets of tB but we do not need to assume this for the general 
result (we restrict attention to Riemann measurable A). For t > 1, y G Z d , and 
A G K(B ), define 

H tty {A) = H t>y (X, A) := H t (r y X, A) (2.2) 

In |14j . a CLT is established under a stabilization condition which says, loosely 
speaking, that the effect on a given random set function H of resampling the value 
of X at a single site is local. To extend these to central limit theorems for random 
fields, we require a modification of the stabilization condition used in [T4"] . 

Let X° be the process X with the value X at the origin replaced by the in- 
dependent copy X* of X , but with the values at all other sites the same (i.e., 
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X° = (X°, z G Z d ) is the random field given by X° = X* and X° z = X z for 
z E 7j d \ {0}). Given a random set function H, define the increment Af {A) for 
y G Z d and t > 1, A G ft(-B ), by 

Af y (A) = ^, y (X, A) - ^(1°, A). (2.3) 

We consider random set functions H with the property that there exists a random 
variable A^ such that for all A G TZ(Bq), and all [l,oo) x Z d -valued sequences 

\f"ni yn)n>l- 

Al yn (A)-^Ag if liminf(r, n (t n A)) = R d (2.4) 

n-^-oo 

and also 

A L»(^)-^° if liminf(r,„(t„( J B \A)))=R d . (2.5) 



n^oo 



Eqns (|2.4jl and (|2.5j) are our stabilization conditions. The second condition (|2.5jl is 
a novel feature of this paper; it was not required for the CLTs presented in |14[ I18j. 
The first condition ()2.4j) is similar to the stabilization condition in Definition 2.3 of 

We shall require also that there exist 7 > 2 such that the moments condition 

Sup{E[|Af_ y (A)H : A G K(B ),t > l,ye W } < 00 (2.6) 

is satisfied. Observe that Af* (A) is identically zero for y G Z d \ tB , and therefore 
condition ()2.6|) is equivalent to 

sup{E[|Af_ y (A)H : A G K(B ),t> l,y G Z d } < 00. (2.7) 

For y G Z d , let T y be the a-field generated by (X z , z =4 y) (recall that ^ denotes 
the lexicographic ordering on Z d ). Now we can state our main general result for 
lattice systems. 

Theorem 2.1 Suppose B G TZ(R d ) with \B \ > 0. Suppose that H 1 ,...,H k are 

random set functions on Bq, each of which satisfies the stabilization conditions (W2) 
and H3) ; along with the moments condition for some 7 > 2. Let the k x k 

matrix j=i be given by 

4:=E[E(A^|^ )E(Af |^ )]. (2.8) 
Then if A, . . . , A^ are Riemann measurable subsets of Bq, for l<i<j<kwe have 

]hnt- d Cov(W t (A i ),Hi(AA) = *u\A i nA j \, (2.9) 

and as t — > 00, 



(t-^HHAA - EFi(A-)))f=i AT(0, n A,|)^ =1 ). (2.10) 
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In many examples, we consider only the case of a single random set function, that 
is, the case where each of H 1 , . . . , H k are all the same random set function H. 
In this case the result says that all the finite-dimensional joint distributions of 
(t~ d/2 (H t (A) -EH t (A)), A G TZ(Bq)), converge to those of a centred Gaussian pro- 
cess (W(A),A G 1Z(Bq)) with covariance function 

E \W{A)W{A')} = \A n A'|E [(E [A£| F ]) 2 ], 

i.e., a white noise process. 

2.3 Central limit theorems for continuum systems 

By a point process set function we mean a real-valued functional h(X, A) defined for 
all A G TZ(R d ) and finite subsets X of R d , such that 

(i) (xi, . . . , Xk) !— > ^({^i, • • • , A) is a Borel-measurable function, for all k G 

(ii) for all A G 7£(R d ), y G M d , and all finite X cR d , h satisfies the translation- 
invariance condition 

h(r y (X),r y (A)) = h(X,A). (2.11) 

For A > 0, let V\ denote a homogeneous Poisson point process in R d of intensity 
A (viewed as a random subset of W 1 ). Given B G 1Z(R d ) with \B \ > 0, define the 
point processes 

v x , t :=v x n(tB ), t>\. 

We derive a multivariate central limit theorem for (h(V\ jt ,tA), A G 1Z(B )) as t — > 
oo. The conditions on ft, for our central limit theorem are defined in terms of the 
"add one cost on A" defined by 

5(A, X) := h(X U {0}, A) - h(X, A). (2.12) 

We shall say the point process set function h is strongly stabilizing at intensity A if 
there exist almost surely finite random variables S^Vx) (the stabilizing limit of h 
at intensity A) and S (a radius of stabilization of h at intensity A) such that with 
probability 1, V\ is such that for A G K(R d ) and for all finite A C (R d \ B s (0)), 

5(A,(V x nB s (0))UA) = 5 oo (Vx) HADB s (0), (2.13) 

and 

5{A,{VxnB s {0))uA)=0 if Anfi 5 (O) = 0. (2.14) 

Thus, 5 is a radius of stabilization if the add one cost on A for the restriction of V\ 
to a region containing the ball -85(0) is unaffected by changes in the configuration 
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outside the ball B s (0) if B s (0) C A or S 5 (0) n A = 0, taking the value 5oo(P A ) if 
-Bs(O) Q A and the value zero if -65(0) D A = 0. Our notion of strong stabilization 
()2.13j) is similar to that used in ^Bj- The second stabilization condition (|2.14j) . like 
its discrete counterpart ()2.5j) . is new to this paper. 

As in [IB], as well as strong stabilization we have a notion of 'weak stabilization' 
which we shall describe in Section |S] Loosely speaking, the distinction is that in 
(j2.13|) and ([2. 14)1 . the set A runs through all finite sets in \ Bs(0), whereas the 
corresponding weak stabilization conditions (eqns (|5.1|) and ()5.2|) below) refer only to 
subsets of the underlying Poisson process V\. Theorem 12 . 21 below is stated under the 
strong stabilization conditions ()2.13j) and ()2.14|) but actually still holds if these are 
replaced by the weak stabilization conditions (j5.1|) and ()5.2|) . Theorems 12.31 and 12.41 
really require the strong stabilization conditions (j2.13J) and ()2.14|) . All the examples 
discussed here satisfy (J2.13|) and (j2.14|) but there may be examples satisfying weak 
but not strong stabilization, for example in relation to germ-grain (Boolean) models 
[2] IB] EI] with no bound on grain sizes, or to the random connection model with 
long-range connections ^3 E] ■ 

Let A > and let B e H(R d ) with \B \ > 0. Given t 6 [1, 00) and m e N, 
let U m j be a point process consisting of m independent random ci-vectors, each of 
them uniformly distributed on tB . Also let /iA,t be the expected number of points 
of V\ lt , i.e., let 

H Kt := Xt d \B \ (2.15) 
We consider functionals h satisfying the moments condition 

sup sup {E[S(r x (tA),T x (U mtt )) 4 }} <oo. (2.16) 

t>l,AeK(B ),xe-tB me[ji Ait /2,3/i Ait /2] 

In the sequel, it is likely that the fourth moments condition ()2.1fij) can be replaced by 
a 2+e moment condition, but this would not greatly expand the range of applications 
known to the author. 

We also require a mild uniform bound on h in terms of the size of X, whereby 
there exists a constant 02 such that for all finite sets X C M d , and all A £ 7£(lR d ), 

\h(X,A)\ < ^(diam(^) + card^f 2 . (2.17) 

We now give a multivariate CLT for h(V\,t, tA). This is the first of our continuum 
analogues to Theorem 12.11 

Theorem 2.2 Let A > 0, and let B e B(R d ) with \B \ > 0. Suppose that 
h 1 , . . . ,h are point process set functions which satisfy the stabilization conditions 
H~73]) ; (2.14h the moments condition \2. 16} . and the uniform bound (2.11). Define 



the k x k matrix (cr^)^ =1 by 



4 =E[E(^(P A )|^)E(^(P A )|^)], (2.18) 
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where T denotes the a-field generated by the Poisson configuration in the half-space 

{x = (xi, . . . ,x d ) G R d : X\ < 0}, and S^Vx) is the stabilizing limit of h l . If 
A\, . . . , Ak are sets in TZ(B ), then as t — > oo ; 

t- d Cov{h\Vx,tMi),h\Vx,tM 3 )) -> Xa^AiHAjl (2.19) 

and 

t~ d/2 (h i (Vx,t, Ai) - 'Kh i {Vx,u A))i=i A/"(0, (A<7y|Aj n A,-|)^- =1 ). (2.20) 
The next result is a de-Poissonized version of Theorem 12.21 



Theorem 2.3 Let A > and suppose Bq G 7Z(M> d ) with \Bq\ > 0. Suppose that 
h 1 , . . . , h k are point process set functions, each W satisfying the strong stabilization 
conditions (with stabilizing limit denoted S^iVx)) ar ^d \2. 141 , along with the 

moments condition \2. i6|) . and the uniform bound <\2.17\) . Let the matrix (cr^)^ =1 
be given by (jff.i&jl . Then if Ai, . . . , A k are sets in 1Z(B ), if we define the matrix 
T x : = (r^)t=i by 



41 


Ai n Aj 




A 




A 3 \ 




\B \ 








Bo 


2 



rti ■= ,J ',' " - L 4Wl^ E [^(^a)]E [^(P a )], (2.21) 



and if (t n ) n >i is a [1, oo)-valued sequence satisfying 

limsup^ 1/2 |(A^| J B | -n)\ < oo, (2.22) 

then for i,j G {1, . . . , k}, 

lim n- 1 Cov(^(W nA ,t n ^),^(W nA ,t n ^)) = (2.23) 

and as n — > oo, 



n 



(h'iUn^tnAA -Eh'iUn^t^))^ Af(0,T A ). (2.24) 



Given 7 G R, we shall say h is homogeneous of order 7 if 

h(aX, aA) = a?h(X, A), Va G R, A e ft(R d ), finite X C R d . (2.25) 

If ft satisfies homogeneity, it is easy to deduce from the above theorems a multivariate 
CLT, either for a homogeneous Poisson processes of intensity A on Bo as A —>■ 00, 
or for a sample U n ,\ of non-random size n from the uniform distribution on B as 
n —* 00. Here, we just state a result of the second type. 
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Theorem 2.4 Suppose B G TZ{R d ) with \B \ > 0. Set A := |B | _1 . Suppose 
h 1 , . . . , h k are point process set functions, satisfying the strong stabilization condi- 
tions i\2.1<^ . \2. 14h the moments condition i\2.16} . the uniform bound [ 2.11 ), and 



homogeneity of order 7 ^2.2f^ for some 76I. Suppose that A\, . . . , are sets in 
n(B ), and let T A « = (r A % =1 be given by i\2. 21\ . Then for i,j G {1, . . . , k} 

lim n (2 ^ ) - 1 Cov(/i i (W n , 1 , Ai), h j (U n ,!, Aj)) = (2.26) 

and as n — > oo ; 



(7/d) - 1/2 (^(W n ,i, Ai) -WiUn^A^U AT(0,r Ao ). (2.27) 



Theorem 12.41 is easily proved by applying Theorem 12.31 with t n = n 1 ^, and using 
homogeneity of h % to deduce that 

(n^hXUn^AA)^ £ {WlU^n^AVU- 

Many applications are concerned with functionals of graphs of the form G : = 
G(X) defined for each locally finite point set X C M. d (a locally finite subset of lR d 
is one with no limit point), where G(X) has vertex set X. See Sections 13.21 and 13.31 
for examples. 

We shall say G is translation invariant if translation by y is a graph isomorphism 
from G(X) to G{r y {X)) for all y G M. d and all locally finite point sets X. We shall 
say G is scale invariant if G(aX) is isomorphic to G(X) for all X and all a > 0. 

We use the following notion of stabilization for these graphs. Given G, and given 
a vertex x £ X, let £ + (x;A') be the set of edges of G(X) which are not edges of 
G(X \ {x}), and let £~(x; X) be the set of edges of G(X \ {x}) which are not edges 
of G(X). Let V® := V\ U {0}. Our stabilization condition for graphs is that there 
exists an almost surely finite random variable R such that 

£+(0; V° x ) = £+(0; (V° x fl B R (0)) U A) (2.28) 

and 

£-(0; V° x ) = S-(0; (V° x fl B R (0)) U A) (2.29) 

for all finite icl d \ B R (0). 

The stabilization conditions (|2.28jh ()2.29|) say that the local behavior of the 
graph in a bounded region is unaffected by points beyond a finite (though possibly 
random) distance from that region. As we shall see, the minimal spanning tree 
(with the definition suitably extended from finite to locally finite point sets) and the 
fc-nearest neighbours, and sphere of influence graphs all satisfy ()2.28|) and (|2.29j) . 

Another technical condition that turns out to be relevant to stabilization is 
uniqueness of the infinite component for G(V\) and for G(V X ). For a locally finite 
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point set X, we say that uniqueness of the infinite component holds for G(X) if 
there is almost surely at most a single infinite component of G{X). 

Given G, we consider three types of functional based on G. Firstly, we consider 
the number of components of G(X) with at least one vertex in A, which we denote 
K G (X,A). 

Second, functionals such as total length of edges in A, number of edges in A, 
or number of edges in A of less than some specified length may be interpreted as a 
total of 0- weighted edge lengths in A, i.e., as a sum 

I${X,A):=\ £ £ 0(|e|), (2.30) 

xeXnA e ={x,y}eG(X) 

for some appropriately specified function : (0, oo) — > M. 

Third, we consider functionals such as the number of vertices in A of some 
specified degree, or the number of components in A with a specified number of 
vertices, which are obtained by summing over all vertices in A some function of the 
local graph landscape of G (not the edge lengths) at that vertex. To make this 
precise, let K denote the set of unlabelled connected rooted graphs (i.e., connected 
graphs with a single vertex distinguished and denoted the root). For k e N, let K. K 
denote the set of graphs in K, which have all vertices a graph distance at most k from 
the root (the graph distance between two vertices is the minimal number of edges 
in a path between them, or infinity if no such path exists). For k G N, let B()C K ) 
denote the class of all bounded real- valued functions from K K to R. For ip e B(JC K ) 
and for any vertex x of any locally finite point set X C M d , let G X , K (X) denote the 
rooted subgraph of G(X) induced by all vertices a graph distance at most k from x 
(with root at x), and let 

V?(X,A):= ^( G *A*))- 

Lemma 2.1 Suppose G is translation invariant and satisfies the stabilization con- 
ditions ()2.2^i and [\2.29^ . Then if we set h(X,A) = L G (X,A), the stabilization 
conditions §2. Iffi and i \2.14 ) hold. If instead, for some k, e N and k, e B(JC K ), we 



set h(X,A) = V^(X,A) then, again, conditions (\2.1cty and ( 2.14 ) hold 



Suppose in addition that uniqueness of the infinite component holds for G(V) 
and for G(V°); then if we set h(X, A) = K G (X, A), then the stabilization conditions 
([Hi and ( |E71 hold. 

As we shall see in examples below, one can use Lemma I2~T1 to check the applicability 
of Theorems 12.21 12.31 and 12.41 to a variety of point process functionals based on 
stabilizing graphs. 

2.4 Marked point processes 

In the application in Section 13.41 we need to consider the extension of the results 
of the preceding section to functionals of marked point processes with marks in the 
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unit interval. A marked point set in M. d is a locally finite subset of M. d x [0, 1] with 
no two elements having the same coordinate projection onto M d . 

If X = {(xi, U), i > 1} C M. d x [0, 1] is a marked point set in M d , and X = {x i} i > 
1} is the corresponding unmarked set (i.e., the projection of X onto R d ), then we 
shall often abuse notation slightly and write X for X, keeping in mind that each 
element x« of X carries a mark tj. Then all the notions and results of the previous 
section carry through, as we now describe. 

For y G M d , the translation operator r y on marked point sets in M. d is to be 
understood to preserve the values of all marks. Then the notion of a (marked) point 
process set function h(X,A), defined for finite marked point sets X in M. d and for 
A G 7£(lR d ), is as given at the start of Section l2~3l Also, the notion of translation 
invariance of a graph G(X) is as defined in Section 12.31 When we consider edge 
lengths and so on, the vertex set of G{X) is still viewed as a subset of M. d , not IR d+1 . 
Also, it is to be understood that scalar multiplication operator X t— > aX on marked 
point sets in IR d , seen in the homogeneity condition (|2.25|) for example, leaves all 
marks unchanged. 

In the marked setting, the points of the <i-dimensional point processes V\,t and 
h( m ,t are to be understood to carry marks which are each uniformly distributed 
on [0, 1] and independent. Also, the inserted point at 0, when defining add one 
costs such as at (j2.12|) . is assumed to carry an independent mark which is also 
uniformly distributed on [0, 1]. The stabilization conditions ()2.13j) and (j2.14|) are to 
be understood to hold for any choice of values for the marks of points in A. Likewise 
the uniform bound ()2.17|) is to be understood to hold for any choice of the marks 
on X. 

With these interpretations, all of the results in Section 12.31 remain valid for 
marked point set functionals on the marked point processes and stabilizing graphs 
on the marked point processes. 

3 Applications of the general results 
3.1 Percolation 

Let E = {0, 1}, let S be the power set of E (i.e. the collection of all subsets of 
E), with P ({1}) = V an d -Po({0}) = 1 — p, p G (0, 1) a fixed parameter. Let 
X = (X z ) z( zid and X° (the same as X but with Xq resampled) be as described in 
Section 1^21 with this choice of (E,£,P ). Let the sets O, O' (the random set of 
'occupied sites' induced by X and by X° respectively) be given by 

O := {z G Z d : X z = 1}, C := {z G Z d : X° z = 1} 

(so that O' A O is either the empty set or the set {0}). 

For any subset £ of 1< d , let G(S) be the graph with vertex set {z G S : X z = 1}, 
and with edges between each pair of vertices at unit Euclidean distance from each 
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other. Then G(0) is a Bernoulli site percolation process with parameter p (bond 
percolation versions of the results in this section also hold, and are proved by similar 
means taking E = {0, see [H], page 1517). 

For background information on percolation see Grimmett fjj. Let p c be the 
critical value of p, i.e., the supremum of the set of p for which the components of 
G(0) are a.s. all finite. Provided d > 2, it is known that < p c < 1. 

By the uniqueness of the infinite cluster in percolation (see, e.g., [7]), there is 
almost surely at most a single infinite component of G(0). For later use, we denote 
the vertex set of this infinite component of G(0) by (possibly the empty set), 
and denote the vertex set of this infinite component of G(0') by C'^. 

Also for later use, observe that for y,z e Z d , 

( Ty X) g = 1 <{=> V + Z 6 O <{=> Z e T-y(0). (3.1) 

We shall give two applications of Theorem 12.11 to percolation. Suppose B 6 
7Z(R d ) with |JBo| > 0. The next result adds to previously known central limit 
theorems for the total number of components in tBo (see [U HI El EI!), an d sa y s 
that the number of components of G(OC\tBo) in disjoint subregions of a large region 
tB are asymptotically normal and asymptotically independent of each other. It is 
of particular interest in the case when p = p c since in this case, correlations are not 
believed to decay exponentially. 

For percolation and also for some of the other spatial graphs that we consider, 
there are several ways to count the 'number of components' in a subregion A of M. d , 
since one has to decide whether to include components that lie only partially in A. 
In results given here, such components are counted fully, but the same results should 
hold if they were counted only partially, or not at all. 

Theorem 3.1 Suppose B e TZ(R d ) with \B \ > 0. For t > 1, A G 1Z{B ), let 
Ht(A) be the number of components of G(0 CMBq) which include at least one vertex 
in tA. 

Let be the number of components of G{0) that include at least one vertex at 
or adjacent to the origin, minus the number of components of G(0') that include at 
least one vertex at or adjacent to the origin. 

Then the conditions <\2-4h (pESj) and (|£.6|) for Theorem \2.1\ are satisfied, and 
therefore the conclusions and $2.1(J§ of that result are valid [with H l = H for 
all i.) 

Remark. Following the approach of Cox and Grimmett [4 to the central limit 
theorem for the number of components in tB , one could generalize Theorem 13.11 
by taking H t (A) to be of the form Ylcec(t,A)' t / , ( t ^)- Here C(t, A) denotes the set of 
Cc Z d such that C is the vertex set of a component of G{0 l~l tBo) which has at 
least one vertex in tA, and ip is some function defined on finite Sctf such that 
G(S) is connected (in Theorem 13 .11 we consider the special case where ip is identically 
1). In this more general setting, by a modification of the proof of Theorem 13. 1[ one 
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can still check the conditions ()2.4|) , ()2.5|) , and ()2.6|) , and hence apply Theorem 12.11 
provided if) satisfies the following conditions: 

1. if) is translation- invariant, i.e. ip(r y (S)) = ip(S) for all y £ Z d and all S cZ d 
such that G(S) is connected. 

2. ip(S) converges to a finite limit as \S\ —>■ oo. 

The above conditions imply that if> is bounded. Unlike in ||], we do not require if) 
to be monotone here and we can take any p £ (0, 1), including p = p c . On the other 
hand, the corresponding set of conditions on if) in ^E] does not include translation- 
invar iance. 

Proof of Theorem^ Let t > 1, y £ Z d , and A £ ft(-B„). % (ED), iZt, y (4) is the 
number of components of G((r_ y O) n £-B ) which intersect tA (i.e., contain at least 
one vertex in tA). Hence, H ty {A) is the number of components of G(0 PI r y (tB )) 
which intersect r y (t-A). 

Thus — Af y (A) is the increment in the the number of components of G[<D fl 
r y (tB )) which intersect r y (tA) when we resample X (i.e., when we replace the 
process X by X°). 

With defined in the statement of the theorem, we assert that ()2.4|) and f)2.5jl 
hold. To verify ()2.4jl . suppose that liminf n _ +00 (rj /ii (/f: n A)) = R d . Suppose first that 
X = and X* = 1. Then there exists a (random) N% such that for n > Ni, every 
pair of vertices lying adjacent to the origin and in the same component of G(0), is 
connected by an path in G(0), all of whose vertices lie in r yn (t n A). Then for all 
n > N\, A^ yn (A) = as described above. A similar argument applies in the case 
with Xo = 1 and X* = 0, and for other cases clearly H tn ^ n (A) = for all n. Thus 
flUJ) holds. 

Next, suppose liminf(r tni j /n (_B \ A)) = M, d . Suppose X Q = 0. There exists a 
random N 2 such that for all large enough n > N 2 , the set T tn ^ n (B \ A) contains all 
finite components of G(0) lying adjacent to the origin. 

There exists a random N3 such that for all n > N3, each pair of vertices of 
which lie adjacent to the origin is connected by a path in G(0) all of whose vertices 
lie in the set r yn {t n (B \ A)). We assert that if n > max(N 2 , N 3 ), changing of the 
value of X from to 1 does not affect the number of components of G(Or\r yn (t n B )) 
that intersect T yn (t n A). This is because for n this big, any two occupied vertices 
adjacent to the origin which are both connected by paths in G(0) to vertices in 
Ty n (t n A), must be part of and therefore are connected by a path which avoids 
the origin, so that they are already part of the same component of G(OnT yn (t n B )) 
even before we add a vertex at the origin to O. In other words, G(Or\r yn (t n B )) has 
at most a single component which intersects both the set r yn (t n A) and and the set 
of sites adjacent to the origin; see Figure H The assertion follows, and one argues 
similarly for X* = 0. Thus ()2.5|) holds. 

Since all vertices in G(Z d ) have degree 2d, the absolute value of A f )V (A) is uni- 
formly bounded by 2c? — 1, and therefore the moments condition (J2.6j) is valid for 



13 



Figure 1: The rectangle represents the region T yn (t n B ), and the region to the right 
of the jagged line is T yn (t n A). The other rectilinear lines indicate the components of 
G(0) adjacent to the origin. 



any finite 7. Therefore Theorem 12. II is applicable here, g 

We now consider the largest component of G(0 PI tBo), adding to the central 
limit theorem for the largest component size given in [Tlj. A largest component of 
G(0 Pi tBo) is a component such that no other component has more vertices. There 
could be more than one largest component; in the sequel, the "vertices which lie in a 
largest component" means the vertices lying in the union of all largest components 
as defined above, while any discussion of properties of "the largest component" 
refers to the case where there is a unique largest component. The following result 
says that the distribution of the vertices lying in a largest component in tBo is 
asymptotically a white noise distribution. In it, we assume Bo is rectangular, i.e., 
that B is a product of bounded intervals. Presumably, the proof can be extended 
to other shapes of B . 

Theorem 3.2 Suppose that p > p c , and that Bq is rectangular. For t > 1 and 
A G TZ(Bo), let H t (A) be the number of vertices oft A which lie in a largest component 
ofG(OntB ). Set 

card(C 00 \ C'J if X o = l,X* = 0, 
A£ = { -card(C^ \ C^) if X = 0, X* = 1, 
if X = X, 



Then the conditions flff.^D, and (|£.6|) for Theorem \2.1\ are satisfied, and there- 



fore the conclusions (jff.ffi and (jff.ifljl of that result are valid [with all H l — H.) 

Proof. Observe first that is indeed almost surely finite. For example, if 
X = 1 and X* = 0, then C^ C Coo and C^ \ C^ consists (in the case where 
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G Coo) of those finite components of G(O') lying adjacent to the origin, along 
with the origin itself (with Coo \ C^ = in the case where ^ Coo). 

Observe also that for any t > 1, y G 7L d and A G TZ(B ), by (jTTjl . H t>y (A) is the 
number of vertices in tA in a largest component of G(t- v (0) PI t-Bo), and so is the 
number of vertices in T y (tA) in a largest component of G(0 fl r y (tBo)). 

In what follows, a few plausible (and actually true) facts are stated without 
proof. For details of their proofs, see [H] . 

Suppose that liminf„^ oo (r 2M (t n .B )) = R d . Then with probability tending to 1, 
the largest component of G{0 fl T yn (t n Bo)) is unique, and is the largest component 
of G(C 00 nr ?/n (t n So)), and if G Coo then the largest component of G(Or\T yn (t n Bo)) 
is the component of G(Coo fl r yn (t n B )) containing the origin. 

Suppose that liminf n _» 0O (r 2/n (t n A)) = M. d . Then with probability 1, the set Coo A 
C'qq is contained in T Vn (t n A) for all large enough n. Hence, the probability that 
A tniVn (A) is equal to A^ defined above tends to 1, and so (|2.4|) holds. 

Suppose that liminf n _^, oo (r y?i (t n (S \ A))) = M. d . Then with probability 1, the 
set Coo A C'oo is contained in r yn (t n (B \ A)) for all large enough n. If Coo A C 
Ty n (t n (B \ A)) and also the largest component of G(0 fl r yn {t n B )) is the sole 
component of G{Coo H r yn {t n B )) containing a vertex adjacent to the origin, and 
also the largest component of G(0' fl Ty n (t n B )) is the sole component of G(C'oo H 
T yn (t n Bo)) containing a vertex adjacent to the origin, then changing the value of X 
from 1 to will not remove any vertices of the largest component lying in r yn (t n A) so 
that it does not change the value of H tn ^ n (A). Hence the probability that Af n yn {A) 
is equal to zero tends to 1, and so ()2.5|) holds. 

We need to check the moments condition ()2.6|) . Most of the ingredients in the 
proof of this are given in the proof of Theorem 3.2 of [Hj. The main difference is 
that we now need to account for a possible decrease in the number of elements in 
r y (tA) of a largest component when we change the status of site from 'vacant' 
to 'occupied' (in we needed only to consider the largest component size, which 
by contrast really is monotone in X ). Such a decrease could happen either if 
G(0 fl T y (tB ) \ {0}) has more than one largest component, or if two or more 
components of G{0 fl r y (tB ) \ {0}) lying adjacent to 0, when merged, form a 
component larger than and disjoint from the previous largest component. However, 
the probability of either of these possibilities occurring decays exponentially in 
(see, e.g., Theorem 4 in Penrose and Pisztora PHI), and using this we can check 
(EH)) here. ■ 

3.2 The minimal spanning tree 

Given a locally finite set X C M d , d > 2, and given a > 0, let Q a (X) be the graph 
with vertex set X and with edge set {{x, y} : \x — y\ < a}. Let MST(A') be the 
graph with vertex set X obtained by including each edge {x, y} such that x and 
y lie in different components of Q\ x _ y \(X) and at least one of these components is 
finite. If X is finite with distinct inter-point distances, then MST(A") is the minimal 
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spanning tree on X, i.e. the connected graph with vertex set X of minimal total 
edge length; see Aldous and Steele ([T|, Lemma 12). Clearly MST(A') is translation 
and scale invariant. 

Recall the definitions of , L^, and B(K, K ) from Section ESI The first part 
of the following result tells us that the totals of a local graph landscape function 
(for example, the numbers of leaves), summed over points of the random minimal 
spanning tree in disjoint regions, scaled and centred, are asymptotically independent 
normals. The second part says that the totals of 0-weighted edges of the random 
minimal spanning tree in disjoint regions, scaled and centred, are asymptotically 
independent normals. In this result, say <fi is polynomially bounded if there exists a 
constant c such that \<p(r)\ < c(l + r) c for all r > 0. 

Theorem 3.3 Suppose G(X) is MST(AT). Let X > 0, and suppose B G TZ(M. d ) 
with \B \ > 0. 

Let k G N, and suppose ip G B(K, K ). If we set h(X) = V?(X,A), then h 
satisfies all the conditions flSZfl , PHI , pTT^I . §2l% , and $2~2% (with 7 = 0) of 



Theorems \2.2\ \2.,% and\2.J\ and therefore satisfies their conclusions $2.1^ . \2.2l^ . 



(Ei JOa , (fOa and (with h j (X,A) = V£(X,A) for all j and with 

7 = 0). 

Suppose instead that we set h(X,A) = L^(X,A) for some <p : (0, oo) — > R. 
T/ien t/ie stabilization conditions $2.13$ and l \2.14l hold. If (f> is bounded, or if (j) 
is polynomially bounded and B is convex, then the moments condition (jff.ifijl holds 
and so Theorems \2.2\ and \2.!A applu and their conclusions i\2.19^ . $2.2(JQ . §2.2!^ and 
(with h? = h for all j) hold. 



If also <fi(r) = r a for some constant a, then the homogeneity hypothesis §2.2ty 
holds and hence the conclusions §2.26} and ( 2.27[ of Theorem \2.J\ are valid with 
7 = a. 

The proof of this uses the following lemma which we shall use again later. 
Lemma 3.1 If Bq C R d is bounded and convex with \Bq\ > 0, then 

inf r- d \B r (x) D B \ > 0. (3.2) 

xeB ,re(o,i] 

Proof. The assumptions on Bq imply that Bq has non-empty interior, so that there 
exists a ball B contained in Bq. For any x G Bq, the convex hull of {x} U B is 
contained in Bq, and since Bq is bounded the angle subtended by this cone-like set 
at x is bounded away from zero, so the result follows, g 

Proof of Theorem \S.r\ Condition ()2.28j) follows from Lemma 2.1 of [19J. Condition 
()2.29j) is more complicated but follows from the proof of Proposition 1 of Lee jllj . 
Therefore, Lemma I2~T1 of the present paper can be applied to give us the conditions 
(I2~T31 . (I2~T1) in the case where either h(X, A) = V^(X, A) or h(X, A) = L$(X, A). 
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Given a finite set X C R , consider the effect on the minimal spanning tree 
MST(Af) of adding a point at the origin 0. Let edges of MST(A" U {0}) that are 
not in MST(A") be denoted added edges, and let edges of MST(X) that are not in 
MST(^ U {0}) be denoted deleted edges. 

By the revised add and delete algorithm of Lee JTQJ , the added edges are precisely 
those incident to in MST^ U {0}), and there are fewer deleted edges than added 
edges. Moreover, there is a uniform non-random bound on vertex degrees in the 
minimal spanning tree (see pQ), and hence there is a uniform bound both on the 
number of added edges and on the number of deleted edges. The moments condition 
(j2.1fij) for h(X , A) = V^(X, A) is immediate from these remarks. Moreover, if is 
bounded, then (j2.16J) for h(X, A) = L^(X,A) also follows from these remarks. 

Suppose that is polynomially bounded and B is convex. By the preceding 
remarks, to prove (j2.16|) in this case, it suffices to show that for any K > 0, there 
is a deterministic uniform bound on the Kill moment of the length of the longest 
added edge when a point at is inserted into U m j with t > 1, G r x (tB ) and 
m/ (Xt d \B \) in the range [1/2,3/2], and likewise for the the longest deleted edge. 

We assert that the longest deleted edge in a finite set X is at most twice as long 
as the longest added edge. To see this, suppose that {X, Y} is a deleted edge. Then, 
since all added edges are incident to the added point at 0, and there must be a path 
from X to Y in MST(X U {0}), there exist points X', Y' in X such that X', Y' are 
both adjacent to in MST(A'U {0}), and such that there a path in MST(A') from X 
to X', and a path in MST(A') from Y to Y'. By the triangle inequality, \X' — Y' | is at 
most twice the length of the longest added edge, and also \X — Y\ < \X' — Y'\ since 
otherwise we could start with MST(A'), then replace edge {X, Y} by {X',Y'} to 
obtain a spanning tree on X of smaller total length, a contradiction. This completes 
the proof of the assertion. 

Thus, to check ()2.16|) for h(X,A) = L^(X,A) when is polynomially bounded 
and B is convex, it suffices to prove the Kth moment of the longest added edge is 
uniformly bounded. This can be proved by an argument along the lines of Lemma 
2.1 of Yukich (using convexity of B , and (|3.2|) ). 

The uniform bound (J2.17j) is trivial for V^f and also holds for since <fi is as- 
sumed polynomially bounded. The homogeneity condition ()2.25|) . for or with 
0(r) = r a , follows from the fact that the graph MST(A') is scale invariant, g 

Remark. In the univariate case Kesten and Lee ([0], Theorem 2) give a CLT for 
L^(Bq) when is monotonically increasing. Our result gives central limit theorems, 
for this univariate case, without this restriction on 0. 

Remark. Consider the empirical distribution of MST edge lengths. That is, let 
N S (X) be the number of edges of MST(Af) of length less than s. For arbitrary B , 
the finite-dimensional distributions of the scaled, centred empirical process 

(t- d / 2 (N s (V x , t ) -EN s (V x>t )), s > 0) (3.3) 
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converge as t — > oo to those of a Gaussian process. Moreover, the finite-dimensional 
distributions of the corresponding scaled, centred empirical process for a binomial 
sample, namely 

(n- 1/2 (JV n -i/d a (Z4,i) - EiV n - 1/ds (Z4,i)), s > 0) (3.4) 

converge as n — > oo to those of another Gaussian process. 

In the case of the first empirical process ()3.3j) . this follows by taking an arbi- 
trary set of positive 'times' s\, . . . ,Sk and applying Theorem 12.21 with h l (X, A) = 
L G i(X, A) and 0*(r) = l{ r < Si } for (1 < i < k). The limiting Gaussian process in this 
case has covariance function given by the function E [Y^K,/], s, s' > 0, where we set 

Y s = E[5 s (oo)\F], (3.5) 

with <5 s (oo) denoting the stabilizing limit for the functional h(X,A) = L G g (X,A) 
with (p s (r) := l{ r < s }, and with T denoting the a-field generated by the Poisson 
configuration in the half-space {x = (x 1 , . . . , x d ) G W 1 : X\ < 0}. 

In the case of the second empirical process (|3.4|) . the convergence of finite- 
dimensional distributions follows similarly but this time using Theorem 12 . 31 and the 
observation that (N s (U nn i/d), s > 0) has the same distribution as (N n -i/d s (U ni i), s > 
0). The limiting covariance function is this time given by Cov(Y s , Y s i), s, s' > 0, with 
Y s given once more by (|3.5|) . but now with A = I-BqI -1 - 



3.3 Nearest-neighbour type graphs 

Let fceN. The fc-nearest neighbour graph (fc-NNG) on a locally finite set X C M d is 
obtained by including an undirected edge connecting each vertex x G X to each of its 
k nearest neighbours (using the lexicographic ordering as a tie-breaker in the event 
of a tie). We also consider the sphere of influence graph (SIG), in which, denoting 
the distance from x G X to its nearest neighbour by R x , we connect vertices x, y of 
X by an edge if and only if Br x (x) fl Bn y (y) ^ 0. 

White noise limits for functional such as and V^j (defined in Section 12. 3J) 
can be derived using either the results in this paper or by other methods based on 
exponential decay, as in [2]. We concentrate here on the component count K G (X, A), 
for which exponential decay is not so clear. 

Theorem 3.4 Suppose G(X) is defined to be the k-NNG on X . Let A > and let 
B G fc(R d ) with \B \ > 0. Then h(X,A) = K G (X,A) satisfies all the conditions 
(EH, PH^l ), CEZI1, ( fETTD , and (fOl (with 7 = 0) for Theorems [HQ and 
U\ Therefore, h(X,A) = K G (X,A) satisfies the conclusions (fFT^I . IfOflj) . (fO^l . 



(dHH , and fljQTTl) m t/iose resntts (mf/i h l = h for all i, and with 7 = 0). 



Proof. There is a deterministic uniform upper bound on the degree of vertices in 
G(X) (see, e.g., Lemma 8.4 of Yukich [2H])> and hence a uniform deterministic bound 



18 



on the change in the number of components of G(X) caused by inserting a single 
point; the moments condition ()2.16|) follows. 

We can obtain the stabilization conditions ()2.13j) and (j2.14j) by using Lemma l2~Tl 
This result is applicable because conditions (|2.28|) and (|2.29|) hold by the proof of 
Lemma 6.1 of (THj, while uniqueness of the infinite component in G(V) and G(V°) 
holds by Lemma 6.4 of JB|- The uniform bound ()2.17|) and the homogeneity (|2.25J) 
are obvious, g 

Theorem 3.5 Let A > ; and suppose B C M d is convex and bounded with \B \ > 
0. Suppose G(X) is defined to be the SIG on X . Then h(X, A) = K G (X, A) satisfies 
all the conditions flH73J ; flOfl , (TOft . $TT% , and (with-/ = 0) for Theorems 

and\^ Therefore, h(X,A) = K G (X,A) satisfies the conclusions (pO^L 
(fOTljl . ifOSj) . flOfl , IfOTajl . and flO?) m tfiose reswto {with h* = h for all i, and 
with 7 = 0). 

Proof. We need to check the moments condition (|2.16j) . This can be done as in the 
proof of Theorem 7.4 of JH]- The 'regularity' condition in that result is implied by 
the condition (|3.2j) here; see the remarks on page 1010 of [THj . 

The rest of the argument is similar to that for Theorem 13.41 For conditions 
(J2.28j) and (J2.29j) . see the proof of Lemma 7.1 of 18J. For uniqueness of the infinite 
component of G(V), see Theorem 7.3 of 18 . The infinite component of G(V°) is 
also unique, by the proof of Lemma 6.4 (b) of [TBJ - ■ 

3.4 The on-line nearest neighbour graph 

Suppose Xi,X 2 , . . . , X n are points in M. d . In the on-line nearest neighbour graph (or 
on-line NNG for short), the points are assumed to arrive sequentially and each point 
Xi, i > 2, is connected by an undirected edge to its nearest neighbour in the set of 
preceding points in the sequence {Xi, . . . ,Xj_i} (using the lexicographic ordering 
on M. d to break any ties). The resulting graph is a tree, which we will denote the 
on-line nearest neighbour graph on the sequence of points (Xi,X 2 , . . . ,X n ). One 
could also consider the on-line fc-nearest neighbour graph defined analogously, with 
each point Xi connected to its k nearest neighbours in {X\, . . . , Xj_x} if % > k, and 
connected to each of X\, . . . , Xk-i if i < k. In our case, the points in the sequence 
will be random, independent and uniformly distributed over Bo or over tBa. 

The on-line nearest neighbour graph on random points is a natural growth model 
for spatial random graphs, although it was apparently introduced only recently, by 
Berger et al. (|3., Section 3). There, the motivation comes from the search for a 
simple model of scale-free networks, the graph being itself a simplification of a model 
of Fabrikant et al. jSj. 

To fit this graph into our present setup, consider a marked random finite point 
set X in M d , where each point X of X carries a random mark Tx which is uniformly 
distributed on [0,1], independent of the other marks and of the point process X. 
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The points are listed in increasing order of mark, i.e., the marks represent time of 
arrival. With this ordering, we connect each point of X to the nearest point that 
precedes it in the ordering, to obtain a graph which we also call the on-line nearest 
neighbour graph on the marked point set X . This definition extends to infinite but 
locally finite point sets. 

Clearly the on-line NNG on U mtt (defined via marked point processes) has the 
same distribution as the on-line NNG (with the first definition) on a sequence 
Xi, . . . ,X m of independent uniform points on tB . Likewise, the on-line NNG for 
V\t (defined via marked point processes) has the same distribution as the on-line 
NNG on (Xi,...,Xn) with {Xi} independent uniform points on tB and N an 
independent Poisson variable with parameter At |-B |. 

As it turns out, the on-line nearest neighbour graph is a nice example of our 
methods because it is stabilizing but has only polynomially decaying correlations 
(i.e., a polynomially decaying tail on the distribution of the radius of stabilization). 
In the following discussion, although we think of the graph as undirected, we shall 
refer to an edge connecting marked points X and Y with Tx > Ty, as an outgoing 
edge from X and an incoming edge to Y. Each vertex (except one if X is finite) has 
a single outgoing edge. 

The existence of an almost surely finite radius of stabilization satisfying (|2.28J) 
and (J2.29)) will be shown later on. To see that its distribution does not have an 
exponentially decaying tail, let L be the length of the outgoing edge from the origin 
in the on-line nearest neighbour graph on V°. With 71^ denoting the volume of the 
unit ball in IR d , we have 

P[L >r]= exp(-Xn d r d t)dt = \~ 1 n d 1 r~ d / e~ u du, 
Jo Jo 

which shows that the tail of the distribution of L decays only polynomially, and L 
is clearly a lower bound for any radius of stabilization. 

Recall the definitions of Vfi, L9, and B(K. K ) from Section |2~B1 The following 
result says that for certain the totals of ^-weighted edges of the on-line nearest 
neighbour graph on random points in disjoint regions, scaled and centred, are asymp- 
totically independent normals, and likewise for the totals of any bounded function 
of vertex degrees summed over vertices in disjoint regions. 

Theorem 3.6 Suppose G(X) is the on-line NNG on X . Let A > and suppose 
B C M. d is convex and bounded with \B \ > 0. For any ip G B(JCi), if we set h(X) = 
V$(X, A) then h satisfies all the conditions pH3j) . fllTX^ , (JH2H, fOT) , and (TQ3|) 



(with 7 = 0) of Theorems \2. 6 A \2.tA and ~2~\ and therefore satisfies their conclusions 



(HH , WM , (HM; (HH and [with W(X,A) = Vfi(X,A) for all j 

and with 7 = 0). 

Suppose instead that we set h(X, A) = L^(X, A) for some 4> : (0, 00) — > E. Then 
the stabilization conditions §2.1cty and \2. 14\ hold. If <fi satisfies the growth bound 

sup ((1 + ry a \4>(r)\) < 00, some a < d/4, (3.6) 

r>0 
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then the moments condition §2.16} holds and so Theorems \2.2\ and \2.tA apply and 
their conclusions (TQ3j) . (fFl/^l . (TOl and $2~£$ (with hP = h for all j) hold. 

If also </>(r) = r a for some constant a < d/A, then the homogeneity hypothesis 
§2. 2b} holds and hence the conclusions §2.26} and [ 2.21\ of Theorem \2.J\ are valid 
with 7 = a. 

Remarks. Provided Bq is convex, Theorem 13 . 61 gives, among other things, a central 
limit theorem for the number of vertices of any fixed degree in the on-line NNG 
on U Tly i or on V\,t- Since any bounded function of the edge lengths satisfies the 
growth bound ()3.6|) . Theorem 13 . 61 also enables us to obtain similar functional central 
limit theorems results on the empirical distributions of edge lengths in the on-line 
nearest neighbour graph, to those described in the preceding section for the minimal 
spanning tree. 

Provided d > 4, Theorem 13.61 gives us a central limit theorem for the total 
length of the on-line NNG on U n ^ or on V\,t (since the function <j)(r) = r satisfies 
the growth bound (|3.6Jl ). This leaves open the question of the asymptotic behaviour 
of the total length of the on-line NNG on U ni i, in dimensions d < 4. As mentioned 
earlier, it is likely that 4th moments condition (j2.16j) can be replaced by a 2 + e 
moments condition in Theorems 12 . 21 and 12 . 31 If this can be done, the total length of 
the on-line NNG on U n \ will satisfy a central limit theorem for d = 4 or d = 3. We 
suspect that a central limit theorem also holds for d = 2, but do not have a proof. 
We believe that the limiting distribution of the (centred) total length is non-normal 
for d = 1; Penrose and Wade [T7j have shown this to be the case for a related graph 
in which Xj is joined to its nearest neighbour to the left in the set {Xi, . . . , Xi_{\. 

Theorem 13.61 also carries through to the on-line fc-nearest neighbour graph, al- 
though we give a proof only in the case k = 1 . 

The proof of Theorem 13.61 uses the following three lemmas. The first two of these 
are purely geometric in nature. Given distinct points x, y G M d , let C x>y denote the 
cone with its point at x and with angular radius 7r/12, centred on the half-line from 
x passing through y. 

Lemma 3.2 Suppose B C M. d is convex and bounded with \B \ > 0. Then 

. \C X:V n B ly _ xl (x) n B \ 

mf - — - — — — -77- > 0. 



x,yeB :x=£y \y — X 



d 



Proof. Take x,y e B with \y — x\ = r > 0. Let z = (x + y)/2. By convexity z G B , 
and geometrical considerations show that 

Bi 

r/2)sin(7r/12)( 2 ) Q C Xj y H B r (x), 

so the result follows from Lemma 13. 11 g 

Lemma 3.3 Suppose C is an open cone in M. d , of angular radius n/6, with its point 
at x G Mr. Then for y G C and z G C we have \z — y\ < max(|,2 — x\, \y — x\). 



21 



Proof. Assume without loss of generality that x = and that \y\ < \z\. Let 9 be 
the angle zOy. Then 8 < tt/3, so cos 9 > 1/2 and by the cosine rule, 



y\ 



\ z \ 2 + \y\ ~ 2 1 2/ 1 • \z\ cos9 < \z\ 2 . 



Recall from Section 12.31 that if a graph G(X) is defined for locally finite point 
sets X C M d , then for x G X, £ + (x; X) denotes the set of edges of G(X) which are 
not edges of G(X \ {x}), and £~(x; X) denotes the set of edges of G(X \ {x}) which 
are not edges of G(X). Also, |e| denotes the Euclidean length of edge e. 

Lemma 3.4 Let G(X) denote the on-line nearest neighbour graph on the marked 
point set X , and suppose B C R d is convex and bounded with \B \ > 0. Let A > 
and let fi X ,t '■= ^t d \B \. Let < a < d/4. Then 



sup sup E 

t>l,x&B mG[/iA,i/2,3£tA,t/2]nZ 



in 



E 



:i + ki 



ee£+(z;W m , t U{z}) 



< OO 



(3.7) 



and 



sup sup E 

t>l,x&B m£[/i Ai t/2,3^A,t/2]nZ 



4-1 



ee£-(x;U m ,tU{x}) 



:i + iei) c 



< oo. 



(3.8) 



Proof. Fix t,m and x with t > 1, m/fi t G [1/2,3/2], and x G tB^. Take cones 
C\, C*2, . . . , Ck, each with angular radius 7r/12 with point at x, and with union M d , 
where K is a constant depending only on d. For 1 < % < K, let be the cone of 
angular radius 7r/6 with point at x, concentric to Cj. Let 1 < % < K, and let the 
random variable Ri be defined as follows: 

• If there exists a point Z of U m ,t lying in the cone Cj, and carrying a mark 
Tz < T x , let Ri be the distance from x to the nearest point Y of W m)t lying in 



the cone , and carrying a mark Ty < T x . 

If no such point Z exists, set Ri to be sup^g^p^. \y — x\ 
from x to any element of tB$ H Cj. 



the furthest distance 



By Lemma I3~B1 all incoming edges to x from points of U m ,t in C must be from points 
at a distance at most Ri from x. 

Let 3^ be the set of points of U m;t lying in the cone Ci at a distance less than 
Ri from x. By definition of Ri, necessarily all points of carry a mark greater 
than T x . Listing the points of as Yi 



. . . , , in order of increasing mark, let 
Mj be the number of points Y^j of such that Y it j lies closer to a; than do any of 
the points Y it i, . . . , (we include Y^i in this set of points). By Lemma l3~3| each 
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incoming edge at x with an endpoint in the cone Ci is from such a point, so that 



A' 



v ee£-+(x;W m ,tU{x}) / \i=l 

A 

< ^^(l + M.^l + i?,) 4 ". (3.9) 
*=i 

When x is inserted into U m ,ti & U removed edges are outgoing from points that are 
connected to x after it is inserted. Hence, the removed edges outgoing from points 
in Cj are the edges outgoing from the points in the set {Y^i, . . . , Yi, v (i)}, which are 
connected to x after insertion of x. Since these points lie in Ci fl Bn i (x), by Lemma 
13.31 they lie within distance at most Ri of each other so that the removed edges 
outgoing from the points in Cj are of length at most Ri, with the possible sole 
exception of the outgoing edge from Y i;1 . 

Also, we assert that the removed edge from Y^i (if there is one) has length at 
most 2 maxi<£<x Re- To see this, note that if x carries a lower mark than any point 
of U m ,t, then all of tBo lies within distance maxi<<<x i?i of x; if not, then for some 
£ < K there is a point of U m ,t in C/ at distance Ri from x carrying a mark which 
is lower than T x , and hence also lower than Ty. t ; the assertion follows. 

By the preceding remarks about removed edges, it follows that 

( 1 + l e l) Q ) ^ ( K (1 + 2 max Ri) a + Y,Mi(l + Ri) A 

K e££-(x;U m ,tU{x}) J V " i=l / 

< \ Y,^ + Ri) a ^ a K + M l ) 



,i=i 

A 



< K*^2{2 a K + Mi) 4 (l + Rif a . (3.10) 

i=l 

Conditional on the set of points 3^ (but not their marks), any of the v(i)\ possible 
orderings of the marks of points of y is equally likely. Note that (^) is the number 
of collections of four distinct points Y i j 1 , . . . , Y^ J4 such that each of Yi > j k , 1 < k < 4, 
lies closer to a; than any point of Y^i, . . . , ^ Jfc -i. 

Given = £, for any jx < j 2 < j-s < j± < £, the probability that each of Y it j k , 
1 < k < 4 lies closer to x than any point of Y i; i, . . . , 3^-1 is equal to j\ j% j$ j± • 
Hence, 

W = fl = V — ^— < (1 + log£) 4 /4!, £ > 0, 

V 4 / J <- -«1«2«3«4 

1<«1<J2<«3<«4<« 



E 

and since trivially 



M 4 < 256 W 1 { M,>4 } + 811 {A/l <4 } < 256 W 



51, 
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we obtain 

E [Mf\v{i) = £] < 81 + 11(1 + hg£)\ £>0. 

Choose e G (0, 1) with 4a + de < d. Conditional on v{i), the distribution of Mi does 
not depend on the value of Ri, so that for some constant C\ > 0, 

E [Mf\v{i), Ri) < 81 + 11(1 + log v(i)) A l {u[l)>0} < ci(l + v(i) £ ). 

Next, we assert that the conditional distribution of given Ri = r, is stochas- 
tically dominated by the Binomial Bi(m, TT^r d / {t d \BQ\)), where for p > 1 we set 
Bi(n,p) := Bi(n, 1). To see this, let U~ t denote the set of points of W m t which carry 
a mark less than T x , and let U^j '■= Mm,t \ Mmt- Let N := card(ZY~ t ). Then 

u(i)=caid(pt+ tt nB Ri (x)nC i ), 

while the value of Ri is determined by the configuration of U~ t . The conditional 
distribution of U^ t , given U~ t , is that of m — N points independently uniformly 
distributed in tB (thus, this conditional distribution depends on W~ t only through 
the value of N). Hence, given Ri = r and N = n, the conditional distribution of 
u(i) is binomial 

Bi(m - n, |S r (a;) D C { n tS |/|^o|), 

and since all possible values of m — N are at most m, this conditional distribution 
is stochastically dominated by Bi(m, 7idr d /(t d \B \)), justifying the assertion above. 
By the preceding assertion, since we assume m < 2Xt d \B \, we have 

E [Mt\Ri\ < Cl E [1 + (Bi(m, 7c d R d /(t d \B \))) £ ] 

< Cl (l + (2\n d R d y) 

by Jensen's inequality. Hence, for suitable C2, 

E [(1 + Mi)\l + R t f a ] < c 2 E [max(l, Rf a+d£ )}. (3.11) 

For any r > 0, we have P[Ri > r] = unless there exists y G ti?o H Cj with 
|y — x| = r, in which case by Lemma f3. 21 

|C+ n B r (x) ntB \> \c x , y n fi r (x) n tB \ > c 3 r d , 

for some constant C3 > depending only on B . Hence, by conditioning on the value 
of T x we have for large enough s that 

< I exp(-mc 3 us d/{4a+d£) /(t d \B \))du 
Jo 
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and since we assume m > Xt d \Bo\/2, this is bounded by a constant times s d /( 4a + d£ )_ 
Hence, there is a constant C4 such that 

/oo 
P[Rf a+d£ >s]ds<c 4 . 

Hence, using (|3.9jl and (|3.11jh we obtain ()3.7|) . The proof of ()3.8|) using ()3.10|) is 
similar. | 

Proof of Theorem 1 ,7. 61 We assert that there exists an almost surely finite radius 
of stabilization R satisfying ()2.28|) and ()2.29|) . To see this, take a finite collection 
of cones Cf with point at and angular radius 7r/6, with union R d ; let R* be the 
distance from to the nearest Poisson point in C~f to with a lower mark than 
To. It is not hard to see that R* is almost surely finite. Then by Lemma l3.3| no 
point placed in at a distance greater than R* from will be connected to in 
the on-line NNG. Also, by Lemma 13.31 again, any Poisson point in Cf \~\Br*(Q) and 
carrying a higher mark than has a lower-marked Poisson point within distance 
at most Ri, even before addition of a point at 0, so that its nearest lower-marked 
neighbour (before insertion of 0) lies in _B 2 i?*(0). Hence the set of edges added 
or removed upon insertion of a point at the origin is unaffected by changes to V\ 
outside -B2max 4 r* (0); in other words, 2maxji?* is a radius of stabilization in the 
sense of f!2.28|) . ()2.29|) . Thus we can apply Lemma |2~D to get the conditions (|2.13|) 
and (J2~TH) for either h(X,A) = V?(X,A) or h(X,A) = L$(X,A). 

The case a = of Lemma 13.41 gives us the condition (|2.16|) for the functional 
h(X, A) = Vfi(X, A) for any ip G B()Ci). Also, the uniform bound (|2~T7|) is obvious 
for any such h, and by scale invariance of the on-line NNG, the homogeneity condi- 
tion ()2.25|) with 7 = also holds. Thus Theorems 12.21 12.31 and 12.41 are all applicable 
in this case. 

Turning to the case where h(X, A) = L9(X,A), with satisfying the growth 
bound (|3.6j) . once again Lemma I3~4l gives us the condition (|2.16|) . Also, (|2.17j) is 
again obvious in this case, so that Theorems 12.21 and 12.31 are applicable in this case. 
By scale invariance the homogeneity condition (|2.25|) holds (with 7 = a) for the 
case 0(r) = r a , so that Theorem 12.41 is also applicable in this case, g 



4 Proof of the general CLT for lattice systems 

Assume throughout this section that X = (X z , z e Z d ) is as described in Section l2~2l 
Assume also that Bq G 7Z(M, d ) satisfying \Bq\ > is fixed, and that for i — 1, 2, . . . , k, 
(Hl(A),t > 1,4 6 1Z(B )) is a random set function on B as described in Section 
12. 21 satisfying the stabilization conditions ()2.4)1 and ()2.5j) along with the moments 
condition ()2.6|) for some 7 > 2. Assume also that (£ n ) n >i is an arbitrary [l,oo)- 
valued sequence which tends to infinity as n —>■ 00. 
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For y G Z d , and i G {1,2,..., k}, since the shifted family of i.i.d. variables r y X 
has the same joint distribution as X, by ()2.4|) there exists a random variable 
such that for A G 7l(B ) and z n G Z d ,n > 1 with Iiminf„_ +O0 (r 2n (t n y4)) = R d , we 
have 

#L„(VM) - #L„((V0V) A^ ftf . (4.1) 

In other words, A^ is defined in just the same manner as A^ 1 at ()2.4|) but using 
the shifted family of i.i.d. variables r y X. 
For y G Z d , let 

F;:=E[A^|^]. (4.2) 
By the conditional Jensen inequality, Fatou's lemma, and the moments condition 

ra, 

E [(F;) 2 ] = E [(E [A^|^]) 2 ] < E [(A^) 2 ] = E [(A^) 2 ] < oo. (4.3) 

Lemma 4.1 Let A G TZ(B ), with \A\ > 0, and let i,j G {1, 2, ... , k}. Then 

(ttlAir 1 Yl =E[E(A^(oo)|^o)E(A J (oo)|J- )]. (4.4) 

ye(t n A)nz d 

Proof. Since T y is the cr-field generated by (r y X z ) z ^ , the definition of F y F-j in 
terms of r y X is the same as that of FqFq in terms of X. Hence the random field 
(F y F^,y G Z d ) is a stationary family of random variables. Also, each variable F y 
has has finite second moment by ()4.3j) . and likewise for F£, so that \F y F^\ has finite 
first moment by the Cauchy-Schwarz inequality. 

Also, the cr-field of translation-invariant <r(X)-measurable events is trivial (see 
Durrett [3], chapter 6, lemma 4.3). 

The result follows from the classical Ergodic Theorem ([S], chapter 6, section 2). 
For details, see the proof of eqn (2.8) of [13]. In the terminology of [Hj, the sequence 
of sets (t n A) n >i has vanishing relative boundary because of the assumption that A 
is Riemann measurable. This assumption also implies that card(t n A D Z d ) ~ t^\A\ 
as n — > oo. g 

For y G Z d , let X y be the random field X with the value X y at site y replaced 
by the independent copy X, (i.e., X y = (X|, z G Z d ) with X y = X, and XJ = X z 
for z ^ y). For t > 0, and 1 < i < k, set 

A^(A):=^(X,A)-^(X^,A). (4.5) 

Observe that X y = T_ y ({r y X)°) , so that 

A^(A) = Hl_ y (r y X,A)-Hl_ y ((r y X)°,A). (4.6) 
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Therefore by the definition (|2.3|) , since the translated random field r y X has the same 
distribution as X, 

Ky( A ) = K-y( A )- ( 4 - 7 ) 

Lemma 4.2 Let A G TZ(B ), and i E {1, 2, . . . , k}. Suppose {y n )n>i is a Z d -valued 
sequence. Then 

£mE[(Ai niJ/n (A)-A^J 2 ]=0 if \imM(r. yn (t n A))=R d (4.8) 

and 

hm E [(Al jyn (A)) 2 ] = if liminf(r_ yri (t n (So \ A))) = R d . (4.9) 

Proof. The second limiting expression (|4.9|) follows from the distributional identity 
(|4.7jl along with the second stabilization condition (|2.5j) and the moments condition 
([2~Kjl (see [21 A 13.2(f)). 

To prove (|4.8p . observe that since A^ is defined in terms of r^X in the same 
manner as A^ is defined in terms of X, we have by ()4.6|) that 

= E (r^X, A) - flt^CCr^Jf) , A) - A^J 2 ] 

= E [{Htn,-yJ<X, A) - Hi n _ yn (X°, A) - A^) 2 ]. 

If liminf (r- yn (t n A)) = M. d , then this tends to zero as n —>■ oo by the stabilization 
and moments conditions ()2.4j) and (|2.fij) (again see [21] A 13.2(f)). n 

Recalling the definition of A at (|2.1|) . define the sequence of sets (B n ) n >i in Z d 

by 

5 n := (4.10) 
For i e {l,2,...,k}, y eZ d ,t> 1, and A G 7£(-B ), let 

F4(A):=E[A^(A)|^]. (4.11) 
For 7 > 1, the conditional Jensen inequality implies that 

nK( A )\ 7 ] =E[iE[A5 iy (i4)i^,]n <E[|A^(A)n (4.12) 

and therefore the distributional identity ()4.7|) together with the moments condition 
()2.6|) imply that for some 7 > 2, 

sup{E[|i^,(A)H : A G K{B ),t> l,yetB }<oa. (4.13) 
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Lemma 4.3 For any A G 1Z(B ), A' G 1Z(B ), and any i,j G {1, . . . , k}, as n — > oo 

we have 

C E iKM) F LW - FiFl) ^ U (4.14) 

x£{t n (AnA')ni, d ) 

and 

tn d £ ^ 0. (4.15) 

ze-B„V„(AnA') 

Proof. By the triangle and Cauchy-Schwarz inequalities, 

+(E - ^) 2 ]) V2 (E [(^') 2 ]) 1/2 - (4-16) 

By (ITOJ) and (jOjl . E [(F/ X .(A)) 2 ] and E [(Fj) 2 ] are uniformly bounded. More- 
over, by definitions (j4.2j) . (|4.11|) and by the conditional Jensen inequality, 

E [fe(^) - ^) 2 ] = E [(E [A^(A) - AUFJ) 2 ] 
< E [E [(Aj iX .(A) - A^JVJ] = E [(A^A) - A^J 2 ] (4.17) 

and similarly, 

E [(i&CAO - i^) 2 ] < E [(A^(A') - A^J 2 ]. (4.18) 

For A G TZ(Bq), define 'interior' and 'exterior' lattice sets for the set t n A by 

int n (A) = {z G Z d : 5, 1/2 (z) C t n A}; ext n (A) = int n (£ \ A), 

and the 'boundary' lattice set 

d n {A) = B n \ (int n (A) U ext n {A)), 

which consists of lattice points near the boundary either of t n A or of t n (B Q \ A). 
We assert that 

lim sup E [\F^ X {A)FLA A ') ~ F iH\\ = 0- (4.19) 

n ^°°a;Gmt n (AnA') 

Indeed, if this were untrue we could take a sequence (x n ) n >i with x n G int„(An A 1 ) 
and 

limsupE[|F/ , (A)Fj x (A') - F { F* \] > 0. 
This would imply by (ETTnTl. (l4"T7j) and (HTHl) that 

limsupmax(E[(Aj n ,J^) - A^J 2 ], E [(A^^') - A^J 2 ]) > 0, 

n— »oo 
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which contradicts eqn ()4.8|) from Lemma f4.2[ 

By a similar argument to the proof of (|4.19jh this time using (|4.9|) and ([4.12)1 . 
we obtain 

lim sup E[FI X (A) 2 ]=0, lim sup E [F^ (A r ) 2 ] = 0. (4.20) 

n ^°° xecxt n (A) n ->°° x&ext n (A>) 

By (14.20)1 and the Cauchy-Schwarz inequality, 

lim sup E[\Fl x (A)FLA A ')\] = Q - ( 4 - 21 ) 

n ^°° x£ext n (A)Uext„(A') 

Using the uniform boundedness of both E [\Ff x (A)F^ n X (A') — F*F£\] and 

E [\ F L,x( A "> F Lx( A ')\] ( see fl ^ 1) and Of) we may deduce (Oil from (OSI) . and 
()4.15jl from (j4.21)l . We here elaborate only on the argument for (J4.15jl . The absolute 
value of the sum in the left hand side of (J4.15)) is bounded by four terms, namely a 
sum over x G ext n (A), a sum over x G ext n (A'), a sum over x in a subset of d n (A), 
and a sum over x in a subset of d n {A'). The first two of these terms tend to zero 
by ()4.21)) . while the other terms tends to zero by the uniform boundedness of the 
terms in the sum and the fact that the number of sites in d n (A) is small relative to 
(by Riemann measurability of A and of B Q \ A), and likewise for A', g 

Proof of Theorem, \2.1\ We consider linear combinations. Recall that (t n )n>i is 
an arbitrary sequence tending to infinity, and let bi, . . . , be arbitrary constants. 
By the Cramer- Wold device (see, e.g., |B ) it suffices to prove that with 

a] t := E (E [A* |.F ]E [A*Vo]), (4.22) 

we have 

k / k k \ 

tn d/2 E b i(K( A i) - eh L( a j)) W °> E E h M*i n m^i , (4-23) 

3=1 \ 3=1 1=1 J 

and that the variance of the left hand side of ()4.23j) converges to that of the right 
hand side. We shall represent the left hand side of (|4.23)1 as a sum of martingale 
differences. 

For j,£ G {1,2,..., k}, let A nJ/ := t n (Aj fl Ag) R Z d . Recall that B n := tJ3 
(see ()4.1()p ). Let v n = card(5 n ) and v n ,j,e '■= card(A n j^). Since So, Ai, A2, . . . , A^ 
are all Riemann measurable we have (for each j, £) 

lim (u n /4) = \B \; lim = \A, H A e \. (4.24) 

n— >oo n— >oo 

Define the filtration (C? , Qi, ■ ■ ■ , Qv n ) as follows: let £? be the trivial cx-field, label the 

elements of B n in lexicographic order . and let & = for 1 < % < v n . 
Then 

k u„ 

E^(^(^)-e^(a,)) = Ea, 
j=i i=i 
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where we set Di := Y^j=i bjDij with 

D M : = E [HtiA^m - E [Hl{A 3 )\g^]. 
By orthogonality of martingale differences, 



Var 



1 3=1 



By this representation of the variance, along with the central limit theorem for 
martingale difference arrays (Theorem (2.3) of McLeish [T2|, or Theorem 2.10 of 
Penrose [15J) it suffices to prove the conditions 



sup E 

n>l 



max (C* /2 !A|V 
\<i<v n K ' 

t~ d/2 max \DA 



< oo, 



1<!<W„ 



and 



fc k 



i=i j=i ^=i 



With A( y (A) defined at (jO|) . and defined at fETTTjl. we have 

First we check (|Q5|) . By (jOSj) . we have 



(4.25) 
(4.26) 

(4.27) 
(4.28) 



t" d E [max A 2 ] < t"< £ E [D 2 ] = t~ d ^ E 



i=l 



which is bounded, uniformly in n, by (|4.24|) and f)4.13j) . 

For the second condition ()4.26jh let e > and use Boole's and Markov's inequal- 
ities to obtain 



P 



max 

Ki<u n 



IDA > t d J 2 e 



< 



E 



i=l L n fc 



which tends to zero, by ()4.24|) and the fact that for some 7 > 2, E [| -D;| 7 ] is bounded, 
uniformly over n > 1 and i < v n , by f)4.28j) and ([4.13)1 . 

It remains to prove ()4.27|) . It suffices to prove that for each j, £ 6 {1,2, ... ,k} 
we have 



t~ d D i,j D i,e \Aj n A*|o£. 



(4.29) 



»=i 
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Using fOJ, (EH), (ESI), and (jOl, we obtain 

C E F LMi) F lMd ^ \Aj n A,|E [itfifl. (4.30) 

By the definitions JO} and the right-hand side of (IOUJ) equals \Aj n A t \<r% e . 

By @OEJ), eqn (jOHjl gives us The proof is complete. ■ 



5 Proof of general continuum results 

In this section we prove the results stated in Section 12.31 Recall the definition of a 
point process set function at the start of Section 12.31 and the definition of add one 
cost 5(A, X) given at (j2.12|) . First we give some consequences of the stabilization 
and moments conditions given in that section. 

Given A > 0, given a point process set function h and a random variable ^(^a), 
let us say h is weakly stabilizing at intensity A with stabilizing limit ^(Pa), if for 
any B G IZ(R d ) with \B Q \ > 0, for any A G K(B ), and any ([l,oo) x Revalued 
sequence [t n ,x n ) n >i, we have 

5{T Xn {t n A),V x n{T Xn {t n Bv))) ^ d^Vx) if Yim\ni{r Xn {t n A))=m d (5.1) 

and 

6(T Xn (t n A),V x n(T Xn (t n B Q ))) ^0 if \imM(T Xn (t n (B \A)))=R d . (5.2) 

71— +0O 

Weak stabilization can be viewed as a continuum version of the conditions (j2.4|) . 
(|2.5|l . We also consider ft, satisfying the moments condition 

sup {E^^^Anr^tSo)) 4 ]} < oo. (5.3) 

AG7e(Bo),t>l,^eK <i :OGr I (tB ) 

which is a Poisson point process version of the condition ([2.6)1 . 

Lemma 5.1 Lei A > 0. Suppose h is a point process set function. Then: 

(i) If h is strongly stabilizing at intensity A with stabilizing limit 5 00 (V\) (i.e., 
satisfies <\2.1ty and < \2.14l ), then h is weakly stabilizing at intensity A with stabilizing 
limit 5oo(V\) (i.e., satisfies ()5.ijl and <\5.ty ). 

(ii) If h satisfies conditions \2. i6|) and l \2.17[ , then h satisfies (I5.ffi . 

Proof. Part (i) is obvious. Part (ii) is proved by a similar argument to the proof of 
Lemma 4.1 of ^H], which we omit, g 

Suppose that A > and we are given a point process set function h that is 
weakly stabilizing at intensity A with stabilizing limit ^(^a)- For any locally finite 
set X C R d and any x G M , define 

S^faX) :=limsup(h[(XnB n (x))U{x},B n (x)]-h[XnB n (x),B n (x)]). (5.4) 

n^oo 
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By the definition (|2.12|) of add one cost, and translation invariance (j2.11|) . we have 
$oo{x,V x ) := lim sup 5{B n (0), (t„ x (V x )) n B n (0)). 

Since T- x (Vx) is a homogeneous Poisson process of intensity A, by taking A = B = 
£>i(0), x n = and t n = n in (|5.1jl we see that Soq(x, V\) almost surely equals 
the stabilizing limit S^T-xiVx)) of h with respect to the shifted Poisson process 
t_ x (V\). Thus for all x G M. d , we have as n —>■ oo that 

/i((P A H B n {x)) U {x}, 5 n (x)) - h(V x n ^> <W^a). (5.5) 

Lemma 5.2 Lei A > and let the point process set function h be weakly stabilizing 
at intensity A. Given e G (0,1], let the random vector £(e) be uniformly distributed 
over the cube Q e , independent ofV\. Then 

MCOO^-^M^a) as e|0. (5.6) 

Proof. As K — > oo, we have by (|5.1j) and ()5.5)1 that 

fc((P A n B K (0)) U {0}, 5^(0)) - h(v x n S K (0), B K (0)) ^ M^a); 
/i((^a n S^(e(e))) U {£(£)}, B*(£(e))) 

-/i(P A n B K {Z(e)),B K {£(e))) ^ M^),^)- 

Also, for any K it is the case that P[V X H (e)) ^ P A n S K (0)] -> as e j 0. 
Hence, it suffices to prove that for any integer j > 1, and any K > 0, if X\, . . . Xj 
are uniformly distributed over Bk{0), independent of each other and of £(e), then 

h({ae),X 1 ,...,X j },B K (£{e)))^h({0,X 1 ,...,X j },B K (0)) as e 1 0; (5.7) 
/i({X 1 ,...,A,},5 x (e(e)))-^/i({X 1 ,...,X J }, J B / ,(0)) as £ | 0. (5.8) 

By (|2.17|) . the above random variables are uniformly bounded by a constant (de- 
pendent on j and K). Define h : (R^' -> R and : (R d ) J -> R by 

%i, ■ • ■ , 2/j) : = 2/i, 2/2 + 2/i, 2/3 + 2/i, ■ ■ ■ , 2/j + 2/i}, ^Jc(O)); 
h*(yi,---, yj) ■■= h({y u 2/2 + 2/i, 2/3 + 2/i, • • • , 2/i + 2/i}, #A'(0)). 

If sci lies at a Lebesgue point (see e.g. [2B|) of h(-, x 2 — xi,x s — x\, . . . , Xj — x\) then 

e~ d j \h({x,x 1 ,x 2 , ... ,Xj},B K (x)) - h({0,x 1 ,x 2 , ... ,Xj},B K (0)) \dx 
JQ% 

/ |/i({0, xi - x, x 2 - x, . . . , Xj - x}, B K (0)) 

-h({0,xi,x 2 , . . . ,Xj },B K (0) ) | dx 



e- d 



e I \h(x\ — x, x 2 — xi, . . . , xj — xi) — h(xi, x 2 — x\, . . . , Xj — x{)\dx 
hi 

^ as e | 0, (5.9) 
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where the last line comes from the definition of a Lebesgue point. Similarly, if x\ 
lies at a Lebesgue point of h*(-,x 2 — x%,x 3 — Xi, . . . ,Xj — X\), then 



e / \h({xi,x 2 ,...,x j },B K (x))-h({x 1 ,x 2 ,...,x j },B K (0))\dx 

^ as e | 0. (5.10) 

Since we assume h({xi, . . . , Xj}, B K (0)) is a Borel-measurable function of (xx, . . . , Xj), 
it follows that for all (y 2 , ■ ■ ■ ,yj) G (M^)-' -1 the function h({-,y 2 , ■ ■ ■ ,Vj}) is Borel- 
measurable, and hence, by the Lebesgue Density Theorem (see j22j or pE]), that 
almost every x G M. d is a Lebesgue point of h(-, y 2 , y 3 , . . . , yj). 

Suppose X\, . . . , Xj are independent and uniformly distributed over Bk (0). Then 
for almost every possible collection of values for (X 2 — X 1: . . . , Xj — Xi) the condi- 
tional distribution of Xi conditional on these values of (X 2 — X t , . . . ,Xj — X x ) is 
absolutely continuous with respect to Lebesgue measure on W 1 (in fact, uniform over 
a certain region). Hence, given the values of (X 2 — X±, . . . , Xj — Xi), the conditional 
probability that X\ lies at a Lebesgue point of h(-, X 2 — X±, X 3 — Xi, . . . , Xj — Xi) 
is 1. Thus, with probability 1, X x lies at a Lebesgue point of h(-,X 2 — Xi,X 3 — 
Xi, . . . , Xj — Xi). Hence by (|5.9J) . and the Dominated Convergence Theorem, 



E 



E[\h({{i(e),X 1 ,...,X j },B K (£(e)))-h({0,X 1 ,...,X j },B K (0 
' [ \h({x,X 1 ,...,X j },B K (Z(e)))-h({0,X 1 ,...,Xj},B K (0))\d 

JQh 

^0 as £ I 



Since convergence in L l implies convergence in probability, ()5.7|) then follows. Also, 
by a similar argument to the above, X\ lies almost surely at a Lebesgue point of 
h*(-,X 2 — Xi, X3 — Xi, . . . , Xj — X\) so that using ()5.10|) we obtain convergence in 
L 1 of h({X x , . . . , Xj}, B K (£(e))) to h({X u . . . , Xj}, B K (0)), to obtain (jSHD- ■ 

Next we use discretization and application of Theorem 12.11 to prove a weaker 
statement of Theorem 12.21 which does not include the expression ()2.18|) for a^. In 
the proof we introduce a parameter e which we shall later on make tend to zero to 
establish (jTTHjl . 

Proposition 5.1 Let A > and let B E 7?.(lR d ) with \B \ > 0. Suppose that 
h 1 , . . . ,h k are point process set functions which satisfy the weak stabilization con- 
ditions (|5.iijl . and the moments condition (j^.,^ . Then there exists a k x k 
matrix E A = (a^)*,- =1 such that if A\, . . . ,Ak are sets in TZ(B ), then as t — > oo ; 

r d Cov(h%r x ,tMi),h j (Vx,tMj)) ^ ^-jlAnAjl (5.11) 

and 

t~ d l 2 {h i (V\j, A4) — E h^Vxj, Ai)) k i=l —> A/"(0, (Xafj\Ai n Aj\)^j =1 ). (5.12) 
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Proof. Fix £ G (0, 1]. To apply Theorem 12.11 for z G Z d define X 2 to be the 
point process r_ ez (V\ fl Then X z (z G Z d ) are independent and identically 

distributed (they are independent Poisson processes on Q £ Q of intensity A). Also, 
define the random set function 

Hl(A) := h l (V x ,u teA), t>l,Ae H^Bq), (5.13) 

which is a function of (X z , z G (t/e)B ); here we denote this function by g((X z , z G 

(t/4§ )). Set := tf^X, A), as at (Q. Then iZ^ = g((X y+ZJ z G (f/^Bo)), 
and hence by the translation invariance property (|2.11j) of /i l we have 

H{ y {A) = h\r. ey {V x )ntB ,teA) = h\V x n T £y (tB ),T £y (teA)). (5.14) 

[For example, if h l {X;A) is simply the number of points of X in A, then (using 

the definition of X z above) g((X z , z G (t/e)B )) equals J2 Z carc KT £2 (A 2 ) HteA), and 
hence, 

g((X y+z ,ze (t/e)B )) = ^ cavd(r EZ (X y+z ) n teA) 

z 

= card(r_ £y (P A n Q £ z+y ) fl teA) = card(r_ £2/ (P A ) n teA), 

which is consistent with (15.14)) .] 

We need to check conditions ()2.4j) . ()2.5)) . and ()2.6)) in this context. These refer 
to the increment 

A^(A) = Hl y (X,A)-Hl y (X°,A) 

which, by ()5.14jl . is (minus) the increment in h l (V\ fl r ey (tB ), r ey {teA)) when we 
resample the Poisson process V\ in the cube Q £ . The stabilization condition (|5.1|) 
refers instead to the insertion of a single point at the origin; however, the required 
stabilization (|2.4|) (in the present Poissonian context) can be deduced from ()5.1)) by 
the argument used to prove (3.2) of ^B]- Moreover, a virtually identical argument 
can be used to deduce ()2.5|) from ()5.2|) . 

The proof of ()2.6)) . in this context, from the assumed condition (|5.3|) . proceeds 
essentially by the argument given to prove (3.3) of jTHj; because of this proximity 
we do not give further details. Having established conditions ()2.4|) . ()2.5)) and f|2.6|) 
we may apply Theorem 12.11 to deduce the results ()5.11jl and ()5.12j) (see (|5.1fij) and 
(fQ7j) below). | 

The proof of proposition 15. II just given actually provides us with some informa- 
tion about the limiting variance matrix (c^)ij=i- m ^ ne con t ex t of this proof, the 
a-field J-'q appearing in Theorem 12.11 is. in effect, the cx-field generated by the re- 
striction of the Poisson configuration V\ to U zeZ d z ^oQl-> i- e -; to cubes in the division 
of M. d into cubes Q e z of side e, up to and including Q e Q in the lexicographic ordering. 
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To emphasize its dependence on e, we denote this cr-field by Tq. With the random 
set function H\ defined by (j5.13J) . define u*j(e) (which also depends on A) by 

4(e) :=E[E(A^)E(Af |^)]. (5.15) 

Then the application of Theorem 12.11 in the preceding proof gives us 

lim r d Cov(K\V Kt Mi),V(V x , t M 3 )) = lim t^Cov^^A,), iZ^A,-))) 

t— >oo t— »oo 

= e- d \A i nA j \a* J {e) (5.16) 

and as t — > oo, 

(r^(/ij(7\t.*4) - E^(P Alt ,tA i )))* = i AT(0, (6^(6)1^ n A,|)J J=1 ).(5.17) 

In other words, the matrix (o~^j)^j =1 in the statement of Proposition 15.11 is given, 
for any e G (0, 1], by 

a% = \- l e-%{e). (5.18) 

Proof of Theorem \2.°A In view of Lemma 15. 1\ Proposition 15.11 and the discussion 
above, it remains to prove that if h 1 , . . . , h k are weakly stabilizing at intensity A, 
and satisfy the moments condition ([5.3)1 . then a^, given by ()5.18|) for any e G (0,1], 
is also given by ()2.18|) . 

With % and j fixed, define point process set functions h := h % +h? and h' := h l —h?, 
along with the corresponding random set functions H := W + H^ and H' := H l — H^ 
(where H l and iiP are given at (|5.13|1 ). The definition of H l also depends on e, as 
does the limiting increment A^; from now on we denote the latter quantity by 
A^/ £ , and define A^' £ and A^ ,e analogously. By linearity, for all e > we have 

m^ £ \n]) 2 -^[^' £ \n}) 2 _ ■ 

To prove (|2.18|) . we use the fact that the value of e a*Ae) does not depend on the 
choice of e, since the left hand side of (|5.1fij) does not depend on e and therefore 
neither does the right hand side. The aim is to show, by taking e j in ()5.18)1 that 
cr^- equals the expression 

E[E(^(P A )|^)E(^(P A )|^)] = ~e [(E[U^A)m) a - (E[^(Pa)|-^]) 2 ] , 

where 5oo(P A ) (respectively 5^ Q ('P A )) is the stabilizing limit of the point process set 
function h l + h? (respectively h l — hP). In other words, it remains to prove that 

hm £ - d E[(E[A^|^]) 2 ] = AE[(E[5 0O (P A )|^]) 2 ], (5.19) 

ej.0 

and also a similar limit for H', for which the proof will be identical. 



aUe) = (1/4)E 
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By following the proof of ([IBjj Lemma 3.1) and observing that c(/z) in that proof 
tends to zero as /i j 0, we see that 

limE[(A^) 4 ] = 0. (5.20) 

ej.0 

Let N £ (respectively N' s ) be the number of points of V\ in the cube Q £ (respectively 
the number of resampled Poisson points in Q%). If N e — N' £ — then A^ e = 0. 
Also, N e is JFg-measurable. Hence, 

E [^x e MN^=0}\^o]MNe=0} = E [^ £l {m=0}^{N e =0}\^o\ 

= E[0|^]=0, a.s. 

Hence 

(E [A^\^]l {Ne=0} ) 2 = (E [A^(1 {N , =0} + 1 { ^>o})|^]1{^=o}) 2 
= (E[A^l {JV , >0} |^]l { ^ = o}) 2 < (E[A^1 {JV , >0} |^]) 2 , a.s.. 

Hence, by the conditional Cauchy-Schwarz inequality (see e.g. [5]), and the inde- 
pendence of N' £ and JFq, 

(E [A^\^]l {Ne=0} ) 2 < E [(A^) 2 \^]P[N' E > 0], a.s. 

Taking expectations, then using Jensen's inequality and (|5.20jh we obtain 

s- d E [(E [A£ e |^]) 2 l {JVe=0} ] < E- d P[N> > 0]E [(A^) 2 ] 

<AE[(A^ £ ) 4 ] 1/2 

^0 ase|0. (5.21) 

Let Y £ = (E (A^' £ |JFq)) 2 . By the Cauchy-Schwarz and Jensen inequalities, and 
(EH, 

e- d E[Y £ l {N£ > 2} ] < e-\P[N £ > 2]) 1 / 2 (E [Y 2 ]) 1 ' 2 

< const, x E [(A^' £ ) 4 ] 1/2 ^ as e j 0. (5.22) 

Similarly, 

s- d E [y £ l {iVe=w >i } ] ^0 as e | 0. 

It remains to consider E [l^l{jv 6 =i,i\^=o}]- Since P[iV e = 1, = 0] ~ \e d as e [ 0, 
to establish f)5. 19j) we must show that 

hmE [(E [A H J\ n]?\N £ = 1, N' £ = 0] = E [(E [M^a)^]) 2 ]- (5.23) 

Given e, let Y e (respectively Y^) be the restriction of the Poisson process V\ to the 
union of cubes Q e z with z G Z d and z -< (respectively, with -< z). 
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Given that N £ = 1 and — 0, the restriction of V to Qo consists of a single 
point uniformly distributed over Q e Q and independent of (Y e ,Yg); we denote this 
random point by £'(e). Then, given that A" £ = 1 and N' £ = 0, almost surely A^ ,e 
equals the increment 5oo(£'(e), Y £ U Y' e ) (using notation defined at (|5.4jl ). Thus, 

E [(E [A^|^]) 2 |AT £ = 1, = 0] = E [(E [<M^), Y £ U Y^( £ ), Y £ ]) 2 ] (5.24) 

where, as in Lemma l5.2| £(e) is uniformly distributed over Q s and is independent 
of Pa- 

By the Cauchy-Schwarz and Jensen inequalities, 

E{(E[ ( 5 00 (^5),Y £ UY;)K(5),Y £ ]) 2 -(E[5 00 (P A )K(5),Y £ ]) 2 } 
= E {E [<W£0O, Y e U Y^) + M^)^), Y e ] 

xE [^(C^), Y e U Y^) - tfooWieCe), YJ} 
< E [(<U*(e), Y e U Y^) + 5 oc (P A )) 2 ] 1 / 2 E [(^^(e), Y £ U Y£) - M^a)) 2 ] 172 - (5-25) 

By the moments condition ()5.3|) . the stabilization condition ([5.1)1 . and Fatou's 
lemma, E [^(^a) 4 ] < °o. Also, by definition ^(^a) is almost surely the same as 
^oo(0, V\) which has the same distribution as #oo(£( £ )) V\) by translation-invariance, 
so that 

®[6oo(Vtf]=E[6 00 (£{e) 1 P x ) 4 ] 
> P[V X nQ £ = 0]E [<W^),Pa) 4 |Pa n = 0] 

= e - A£d E[5 00 (e(e),Y £ UY;) 4 ], 

so that E [<Joo(£(e), Y e U Y^.) 4 ] remains bounded as £ | 0. Combining all these 
estimates, we obtain 

limsupE[(5 oo (^(£),Y £ UY0 + 5oo(PA)) 2 ] <oo. (5.26) 

As e I 0, it is the case that P[Y £ U Y' e ^ V\] tends to zero, and hence 

M£(e), Y e U Y^) - 5^(8), V x ) 0. 

Combined with ()5.6|) from Lemma 15.21 this implies that 5 00 (^(e),Y £ U Y' £ ) — > 
^oo(Pa), and hence, using also the fact that ^(^(e), Y £ UY£) has uniformly bounded 
fourth moments, we obtain the limit 

E[(5 oo (ae),Y £ UY' £ )-5 oo (V x )) 2 ]^0 as e I 0. 

Combined with (|5.25|) and (|5.26|) . this shows that 

lim(E{(E[ ( 5 00 (^(£),Y £ UY;)|^(£),Y £ ]) 2 -(E[ ( 5 00 (PA)K(£),Y £ ]) 2 }) =0. (5.27) 
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If we denote by V £ the union of the cubes Q £ z ,z -< 0, then by the definition of Q e z 
in Section I2.1[ we find that V e C V E > for < e' < e, and also U £> oV e is the half- 
space {(xi, . . . , Xd) £ K d : x\ < 0}. Hence the cx-field generated by Y £ increases as e 
decreases, and the smallest a- field with respect to which all Y e , e > are measurable 
is the a-field T generated by the Poisson configuration in the aforementioned half- 
space (which is the same as T given in the statement of Theorem 12. 2 j) . 

By the independence of £(e) from ^ OQ {J-'\) and Y e , along with the Martingale 
Convergence Theorem, as e [ 

E [M^a)|£(£), YJ = E [5oo(^ A )|Y £ ] -> E [M^l-H a.s. 

Since E [JooCPx) 4 ] < oo, the variables (E [5oo(Px)\£(£), Y £ ]) 2 are uniformly integrable, 
so that 

E [(E [M^a)|£(£), Y £ ]) 2 ] - E [(E [5oo(Pa)|-^]) 2 ] as e I 0. 
Combining this with (|5.27jl and (|5.24jl . we obtain ()5.23j) as required, | 

To de-Poissonize the limits flTKfy , (l2~27H) and obtain flOS} and (l2~2ll) . we use 
a coupling technique related to that used in jH] and (THj. Let -B 6 72.(M d ) with 
| -Bo I > 0, and let A 6 7Z(B ). Let U^U^^U^t ■ ■ ■ be independent and uniformly 
distributed over iB ; we assume that the point processes U\ : tM2,tMz,t are coupled 
by setting 

U m ,t = {Ui, t ,...,U m , t }, meN. (5.28) 

With this coupling, given point process set functions h, h', we make the definition 

Rm,t{A) = HUm+i,t, tA) - h(U m , t , tA), (5.29) 
R! mtt {A) = ti{U m+u M) - h'(U m , t ,tA). (5.30) 

Let A > 0, and recall from (j2.15j) the definition 

A*A,t := Xt d \B \. 

We shall use the following coupling lemma, which resembles Lemma 4.2 of [TH] . 

Lemma 5.3 Suppose h is a point process set function which is strongly stabilizing at 
intensity A (i.e., satisfies \2.13\) and \2.1$ ) with stabilizing limit S^Vx). Suppose 
h! is a point process set function which is also strongly stabilizing at intensity A with 
stabilizing limit S'^Vx). Let the random d-vector Y be uniformly distributed over 
B and independent ofVx- Let e > 0. Then there exists r\ > and to > 1 such that 
for all t > to and all integer m, ml G [(1 — f])fit, (1 + v)^} with m < m! , there exists 
a coupled family of variables D, D', R, R' with following properties: 

• D has the same distribution as 5 00 (Vx)^{YeA}; 

• D' has the same distribution as 5' 00 (V\)l{YeA} ; 



38 



• D and D' are independent; 

• (R,R r ) have the same joint distribution as (B, m . t (A), R' m , t (A)); 

• P[{D ^R}U {D 1 ^ R'}] < e. 

Proof. Suppose we are given t. On a suitable probability space, let V and V be inde- 
pendent homogeneous Poisson point processes in ~R d of intensity A; let U, U' , Vx, V 2 , . . . 
be independent variables uniformly distributed over tB , independent of V and V. 

Let V" be the point process consisting of those points of V which lie closer to 
U than to U' (in the Euclidean norm), together with those points of V' which lie 
closer to U' than to U. Then V" is a homogeneous Poisson process of intensity A on 
M d , and moreover it is independent of U and of U' . 

Let N denote the number of points of V" lying in tBo (a Poisson variable with 
mean fi t )- Choose an ordering on the points of V" lying in tBo, uniformly at random 
from all N\ possible such orderings. Use this ordering to list the points of V" in 
tB as W±, W 2 , W N . Also, set W N+1 = V\, W N+2 = V 2 , W N+3 = V 3 and so on. 
Define the point process W n := {Wx, • • • , W n } (for each n > 1), and the increments 

R := h(W m U {[/}, tA) - h(W m , tA); 

R' = ti(W m >-i U {U, U'}, tA) - ti(W m >-x U {U}, tA). 

The variables U, U', Wi, W 2 , W 3 , . . . , are independent uniformly distributed variables 
on tBo, and therefore the pairs (R, R') and (R m j(A), R' m , t (A)) have the same joint 
distribution as claimed. 

Let V be the translated point process T-u(V). Similarly, let V := T-u'(V). 
Then V and V' are independent homogeneous Poisson processes of intensity A on 
M d . Moreover, U and U' are independent of V and V' . Let S be a radius of 
stabilization of h with respect to V, and let S' be a radius of stabilization of h! with 
respect to V . Recall the definition of the add one cost S(A, X) at ()2.12j) . and define 

D = 6(B S (0),V n B s (0))l mtA} ; D' = 5'(B S ,(0),P' n B S ,(0))1 {U , &A} . 

Then D and D' are independent, and D has the same distribution as S 00 (V\)l{YeA}, 
while D' has the same distribution as d'^iVxjl^^A}- 

It remains to show that (D, D') = (R, R') with high probability. Choose K such 
that P[S > K\ < e/9 and P[S' > K\ < s/9. Using the assumption that B and 
A are Riemann measurable, take t to be so large that except on an event (denoted 
Eq) of probability less than e/9, the positions of U and U' are Euclidean distance 
at least 2K from d(tB ), from d(tA), and from each other. Set r] = e{2K)~ d /(18A). 
We assume \m — fi t \ < VtH an d \ m ' — m\ < VfH- Define events Ex, E, E' by 

E 1 := {|JV - m\ > 2r]fi t } U{\N -m'\> 2r]fi t }, 

e ■.= {w m n B K (U) ± v" n tB n b k (u)}, 
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e' : = {\v m , n ^(c/') ^ p" n tP n b k {u')}. 

Event E occurs either if one or more of the (N — m) + "discarded" points of V" lies 
in B K {U), or if one or more of the (m — iV) + "added" points of {V±, V 2 , . . .} lies in 
Bk{U), and similarly for P". Hence, 

P[P|P X C ] < (2 m )(2K) d (A//i 4 ) < e/9; PfP'^] < e/9. 

Using the defining properties ()2.13|) and (|2.14jl of the radii of (strong) stabiliza- 
tion S, S' for V and P', and using Boole's inequality, we obtain for large enough t 
that 

P[(D, D') £ (P, R')} < P[E ] + PIE,} + P[S > K] + P[S' > K] 

+P[E\E 1 ]+P[E>\E 1 )<e. B 

Lemma 5.4 Let A > and let B E 1Z(M. d ) with \B \ > 0. Suppose that h and 
h! are point process set functions which are strongly stabilizing at intensity X, with 
stabilizing limit 5oo(Pa), <^(Pa) respectively, and h and h! both satisfy the moments 
condition $2. lb} . Suppose A e TZ(B ). Let g : [l,oo) — ► (0, oo) be a function with 
g{t)/t d ^ as t — > oo. T/ien P mjt and P^, t defined at \5.2ty) . \5.30\) satisfy 



lim sup 

t_ *°° V.\,t-9{t)<m<iJ,\ > t+g(t) 



Also, 



lim 

t— >oo 



sup 

V\,t-g{t)<m<m' <fi Xtt +g(t) 



ER m>t (A)R'm>M) 



\Bo\ 



\A\ 



E5 00 (P A 



0. 



(5.31) 



(E5 00 (P A ))E^(P A 



= 0, 
(5.32) 



and 



lim 

t— >oo 



sup 

t*\,t-9(t)<m<m' <ii\*+g(t) 



(max(E[P m>t (A) 4 ],E[P^ it (A) 4 ]) < oo. (5.33) 



Proof. We start with f)5.33j) : this follows from the moments condition (|2.16|) . 

Suppose {t{n),n > 1) is an arbitrary (0, oo)-valued sequence tending to infinity 
as n — > oo. Suppose (m(n), n > 1) and (m'(n),n > 1) are N- valued sequences which 
satisfy 

A*A,t(n) - < m(n) < m'(n) < A*A,t(n) + g(t( n ))- (5.34) 

By Lemma IS~31 with (P, P/) distributed as in that result we have as n — > oo that 



fim(n),((n)(^J 



P; 



Rm(n),t(n) (^)-R m '(n)i(n) (4) 



DD' . 



(5.35) 



By (j5.33jl . the random variables R m ( n ),t(n)(A), n > 1, are uniformly integrable, and so 
are the variables ^(uj^nj^^/y^ii)," > 1. Hence we have the convergence 



40 



of expectations corresponding to the convergence in distribution given by ([5.35)1 . 
i.e., as n — > oo we have that 

E [R m ( n ),t(n){A)] E [D] = j^E5 00 (V x ); 



B 



A\ 



nR m{ n) A n){A)B! ml{nWn) {A)}^nDD'} = i^(E5 oc (P A ))E^(P A ), 

and since the choice of t(n), m(n) and ra'(n) was arbitrary subject to lim n _ >QO (t(n)) = 
oo and to 1)5.340 . this gives us (j5.31|) and ()5.32j) . | 

Proof of Theorem \2.<A Assume (t n ) n >i is a (1, oo)-valued sequence satisfying (J2.22I) . 
which says that \t*\B \ — n is (^(n 1 ' 2 ) asn-> oo. 

Assume the point processes V tn ,Ui ttn , U 2 .t n -, W 3jin , ... are coupled by having U mitn 
defined by ()5.28|) and setting V tn = {Ui >tn ,U 2t t n , ■ ■ ■ ,U Nnitn } with N n an independent 
Poisson variable with mean /i n := H\ t Tt = \t*\B \. For 1 < j < k, let 

Cl := h\U nU ,t n A 3 )- g := Upu^). 

Define the fc-vector 



a := (<x,) =1 , with aj :- 



The first step is to prove that as n — > oo, 



Bn 



E[^(Pa)]. 



E 



0. 



(5.36) 



To prove this, (writing £(n) for i n when typographically convenient), note that the 
expectation in the left hand side is equal to 



^ En 1 (h 3 (JA mt t{n),tnAj) - h 3 (U ntt{n) , t n Aj) - (m - n)aj) 2 P[N n = m] 

TTi \ | 'tn— fin |<n 3 / 4 



+n -1 E 



Ci- Ci- (N n - n)a 3 ) l{\N n - ^ (n) | > n^} 



(5.37) 



Let e > 0. By (J5.29)) and Lemma f5. 4) there exists c > such that for large enough 
n and all m with n < m < jj n + n 3//4 , 

E [{h J (U m>t (n), tnAj) - h J {U n)t{n) , t n Aj) - (m - n)aj) 2 ] 



E 



' m— 1 



E^(n)(^)-^) 



< e(m — n) + dm — n) 



where the bound comes from expanding out the double sum arising from the expec- 
tation of the squared sum; the c(m — n) term comes from bounding the diagonal 
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terms using ()5.33|) and the fact that bounded fourth moments imply bounded second 
moments. A similar argument applies when fx n — n 3//4 < m < n, and hence the first 
term in 1)5.37)1 is bounded by the expression 

rT 1 ! [e(N n - n) 2 + c\N n - n|] = n -1 [e(E [(N n - fi n ) 2 ] + (fi n - n) 2 ) + cE [\N n - n\}}. 

By assumption ()2.22j) . we have that fi n ~ n and d := lim sup (/i n — n) 2 /n < oo, so 
that for large n the first term in (J5.37)) is bounded by (3 + d)e for n large enough. 
By the uniform bound (j2.17j) and the Cauchy-Schwarz inequality, there is a constant 
such that the second term in ()5.37j) is bounded by P3n l33 (P[\N n — /i n | > n 3//4 ]) 1//2 , 
which tends to zero, e.g. by Lemma 1.4 of j!5j . Since e is arbitrary and does not 
depend on c', this completes the proof of (|5.36|) . 

Let bi, . . . , bk be arbitrary real constants. Define the column vector b := {pi, . . . , bk)' ■ 

Let 

k k 

Cn := ^ ] ^jCni Cn = ^ ] ^jCn" 
3=1 3=1 

We prove convergence of n~ 1 Var(^ n ), using the identity 

n' 1/2 C = n- 1/2 (n + n- 1/2 (N n - n)a'b + ri" 1/2 (C - Cn - {N n - n)a'b). 

In the right hand side, the third term has variance tending to zero by ()5.36j) . while 
the second term has variance tending to (a'b) 2 and is independent of the first term. 
It follows that with the matrix £ A = (<Ty)* - =1 given by Theorem 12 .21 and the matrix 

Z X > A = (*#%=i g^en by 
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Ai n Aj\ 




\B \ 





we have from Theorem 12.21 that 

b'E A ' A b = lim n _1 Var(C') = lim (n^Var^)) + (a'b) 2 , 

71— >00 71— >00 

so that S A ' A — ao' is nonnegative definite and n _1 Var(C n ) — » b'(£ A ' A — aa')b. This 
gives us dZIZD - 

The proof of Theorem 12.21 f since it is derived by taking linear combinations) tells 
us that ™~ 1/2 (C -EQ) ^ jV(0, b'£ A > A b). Combined with (lo~3fi|) this gives us 

n-V2( Cn - EC + (JV„ - n)a'b) JV(0, b'S A - A b). (5.38) 

Recall that /x n := A/^|-B | = EiV n . Since n~ l l 2 (N n — n n )a'h is independent of ( n 
and is asymptotically normal with mean zero and variance (o/b) 2 = b'aa'h, we can 
deduce from ()5.38|) . by considering characteristic functions, that 

n- 1/2 (( n - E(' n + (/i„ - n)a'h) jV(0, b'(S A - aa')b). (5.39) 
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By (|5.36|) . the expectation of n 1//2 (Cn — Cn — (N n — n)a'h) tends to zero, so in (|5.39|) 
we can replace — + (/x n — n)a'h by — E£ n , which gives us 

Then (|2.24|) follows by the Cramer- Wold device. | 

Lemma 5.5 Let A > 0. Suppose the graph G := G(X), defined for each locally finite 
X C M. d , is translation invariant and satisfies the stabilization conditions \2.2£fy and 
\2.29^l . Then with probability 1, for each X G V\ there exists R(X) < oo such 
that the set of edges of G(V X ) incident to X is unaffected by changes to V\ outside 
Br(x)(X), for each X G V\ there exists R(X) < oo such that the set of edges of 
G(V\) incident to X is unaffected by changes to V\ outside B R ^ X ){X). 

Proof. The existence of finite R{X) for all X G V\ is given by Lemma 3.3 of [19J. 
The existence of finite R'{X) for all X G V\ is proved in the course of the proof of 
Lemma 3.3 of [T^j. ■ 

Proof of Lemma IF71 First set h(X,A) = L%{X,A), as defined at ijZTffljl . Let the 
random variable R satisfy (pT28j) and (f!T29jl . Let A G TZ(R d ), and let A C R d \B R (0) 
be finite. Then if B R (0) C A, the increment 8(A, (V\ n S«(0)) U .A) is equal to 

I E E ^i e ')' ( 5 - 4 °) 

\eef+(0;^nB fl (0)) / ee£-(0;^nBij(0)) 

since all added and removed edges have both endpoints in A. Hence (j2.13j) holds with 
Sooi'Px) equal to the expression displayed in (|5.4()jl . If instead A n B R (0) = 0, then 
5 (A, (V\ fl B R (0)) U A) = since added and removed edges have neither endpoint 
in A. Hence, (|2~T4^1 holds. 

Next, suppose we set h(X, A) = V^(X,A), where ip G B(JC K ), with k G N. We 
assert that there exists an almost surely finite random variable R such that (|2.28jl 
and (|2.29jl hold, and such that for every vertex X of G(V X fl B R (0)) at a graph 
distance at most 2k from some endpoint of some edge in either £ + (0; V x r\B R (0)) or 
£~(0;V X fl B R (0)), the set of edges incident to X is unaffected by changes outside 
B R (0). The existence of such an R follows from Lemma \5. 51 along with an inductive 
argument in k. 

Let ^cK d \ B R (0) be finite. Suppose X G (Pa H Sji(O)) U A) lies at a graph 
distance more than k in G((V X fl .Br(O)) U A) from any endpoint of any edge in 
either £+(0; (V° x n 5 R (0)) U A) or £-(0; (P° x n Sh(0)) U A) (all vertices in A fall in 
this category). Then 

^(Gx, K [(K n s fl (o)) u A]) = ^(g x , k [(Pa n B fl (o)) u A]). 
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Also, for the remaining X G V\ fl B R (0), at a graph distance at most k from the 
endpoint some edge in either £+(0; (T>1 H £r(0)) U X) or £~(0; (PjJ n S H (0)) U .A), 
the value of ^ (G X , K ((V°nB R (0))UA))- ip(G x , K ((VnB R (0))l>A)) is unaffected by 
changes to the set A outside B R (0). The conditions ()2.13j) and (|2.14jl (with S = R) 
follow for this case. 

Suppose now that uniqueness of the infinite component holds, and set h(X, A) = 
K G (X,A). The stabilizat ion conditions are proved, essentially by a slight modifica- 
tion of the proof of Proposition 6.1 of ^H]- For the convenience of the reader, we 
describe the argument in the present, more general context. 

Let R be a radius of stabilization, as given at ()2.28|) and (J2.29j) . Choose a finite 
R' > R such that for any two points of V in B R (0), either they can be connected 
by a path in G(V) all of whose nodes lie in B R i(Q), or at least one of them lies in a 
finite component contained in B R /(0), and such that a similar statement holds for 
V°. The proof that we can choose such an R' is based on the uniqueness of the 
infinite component in G(V) and G(V°), and is given in more detail in [T5] . 

By Lemma 3.3 of [19J, there almost surely exists R" > R' such that for all 
X G V fl B R r(0), the set of edges incident to X in G(V°) is unaffected by additions 
or deletions of points outside B R », and moreover, by the proof of Lemma 3.3 of [19J, 
we can choose R" to be so large that in addition, the set of edges incident to X in 
G(V) is unaffected by additions or deletions of points outside B R n. 

Suppose that A G 7Z(M d ) and B R n(0) C A. Suppose A is disjoint from B R n(0). 
When we change from G(V n B R »(0) U A) to G(V° n B R »(0) U A), the effect is 
first to add a vertex at the origin, then to add the edges of £ + (0; V°), and then to 
remove the edges of £~(0;V°). Consider adding successive edges, in some specified 
order. Each edge reduces the number of components that intersect A by 1 if it 
joins two points that were previously not connected by a path, and otherwise does 
not affect the number of components. The question of whether a particular added 
edge changes the number of components is determined by the graph structure of 
the restriction of G(V) to vertices in B R >(0), and therefore does not depend on A 
or A (always presuming A G 1Z and B R n(0) C A). A similar argument applies with 
deleted edges. 

It follows from the above that if we set S^Vx) := 8(B R n(0),V fl B R n(0)) and 
S = R", then (|HTJ holds. 

Now suppose that Ar\B R n(0) = (and A is also disjoint from B R i/(0) as before). 
Consider again the process of successive additions and deletions described above. If 
an added edge connects two previously disconnected components, then at least one 
of them has a vertex set entirely contained in B R >(0), and therefore does not have 
any vertices in A, and so this change does not cause any increment in the number 
of components that have at least one vertex in A. A similar argument applies with 
removed vertices; hence, if B R »(0) n A = we have 6(A, (V n B R »(0)) U A) = 0, so 
that flUED holds. | 
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